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Immunogenic Sequences 

The present invention relates to nucleic acid sequences, in 
particular genes that encode the enzymes which produce the O- 
5 antigen of Francisella tularensis, and their use as or in the 
production of vaccines and in diagnosis . 

Francisella tularensis is a small Gram-negative 
coccobacillus, which causes the zoonotic disease Tularemia. 
According to Bergey' s manual of systematic bacteriology the 
10 genus Francisella contains two species: F. tularensis and 

Francisella novicida . However, recently several workers have 
suggested that F. novicida be considered a subspecies of F . 
tularensis (Mollis DG, et al . , J. Clin. Micro . 21: 1601-1608). 
The closely related bacterium Yersinia philomiragia is now also 
15 considered a member of the genus Francisella, due to its high 
degree of relatedness at the DNA level- There are several 
proposed subspecies of F. tularensis other than novicida; these 
are: subspecies tularensis, subspecies holarctica and subspecies 
mediaasiatica. The subspecies tularensis and holarctica can be 
20 identified on the basis of virulence, citrulline ureidase 

activity and acid production from glycerol (Olsufjev NG, et al. 
(1959) J. Hyg. Epidemiol. Microbiol. Immunol. 3: 138-149. 
Francisella tularensis subspecies mediaasiatica is predominantly 
found in central asian republics of the former USSR. Strains of 
25 this subspecies possess citrulline ureidase activity, and are 

able to ferment glycerol, but are less virulent than strains of 
F. tularensis subspecies tularensis in rabbit. 

Tularemia is a disease occurring in the northern 
hemisphere; with cases frequently found in Europe, N. America, 
30 Asia, N. Russia and Japan. Rodents are thought to be the main 
reservoir of the bacteria, with ticks as one of the main 
vectors . 

The lipopolysaccharide (LPS) of Gram-negative bacteria is 
the major component of the outer membrane. The molecule is 
35 composed of 3 regions, lipid-A, which is embedded in the outer 
membrane and has a conserved structure between species, and two 
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polysaccharides, the core oligosaccharide which can vary in 
complexity between species, and the O-antigen which is a very 
polymorphic structure (Kenne L, et al- (1983) Bacterial 
Polysaccharides The polysaccharides. Academic Press, pp. 287- 
5 3 62) . The LPS molecule is thought to be required by the 

bacteria for protection against serum killing (Whitfield C, et 
al, (1997) Mol. Micro. 23: 629-638) and cationic antimicrobial 
peptides (Groisman EA. (1994). Trends . Microbiol . 2: 444-449). 

The structure and immunogenicity of LPS isolated from the 

10 less virulent F. tularensis subspecies holarctica strains has 
been studied to some degree (Dreisbach VC, et al. (2000) 
Infect . Immun 68: 1988-1996). Animals immunised with this LPS 
are protected against a subspecies holarctica strain challenge 
(Fulop MJ, et al. (1995). Vaccine 13: 1220-1225), but not a 

15 subspecies tularensis strain challenge (Fulop MJ, et al . (2001) 
Vaccine 19: 44 65-4472) . However, the LPS from a subspecies 
holarctica strain appears to be less toxic than other Gram- 
negative LPS and its O-antigen contains rare sugars which are 
related in structure to those found in Pseudomonas aeruginosa 0 

20 and Shigella dysenteriae type 7. 

There are no reports of LPS isolation from the more 
virulent subspecies tularensis strains. 

When LPS structure is studied in other species, it is 
frequently observed that the only difference in structure 

25 between strains is the composition of the O-antigen. Therefore 
it would be useful to elucidate the structure of the O-antigen 
part of the LPS molecule in virulent subspecies in order to 
provide the basis for diagnostic tests and also to allow it to 
be produced recombinantly , to avoid handling a pathogenic 

30 organism. 

However, the genetic basis of O-antigen expression is 
complex; in most bacteria the genes required for production of 
complete O-antigen are located in a cluster on the bacterial 
chromosome. Therefore identification and isolation of genes 
35 responsible for the O-antigen is not straightforward. 

Furthermore, the identification and isolation of LPS from 
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virulent strains is further complicated because it is difficult 
to stain using conventional methods. 

The applicants have now determined the genetic basis of O- 
antigen production in F. tularensis subspecies tularensis . 
5 Furthermore, they have established the efficacy of LPS from 
various F. tularensis strains as .a vaccine. 

According to the present invention there is provided a 
nucleic acid which encodes a series of enzymes or enzyme 
fragments which, when expressed together in a cell, are able to 

10 produce an immunogenic moiety able to produce an immune response 
in an animal to which it is administered, which response is 
protective against Francisella tularensis infection, said 
nucleic acid encoding at least some of the enzymes of SEQ ID NOS 
3-17, or modifications thereof. 

15 The expression "modification" refers to sequences of amino 

acids, which differ from the base sequence from which they are 
derived in that one or more amino acids within the sequence are 
substituted for other amino acids. Amino acid substitutions may 
be regarded as "conservative" where an amino acid is replaced 

20 with a different amino acid with broadly similar properties. 
Non-conservative substitutions are where amino acids are 
replaced with amino acids of a different type. Broadly 
speaking, fewer non-conservative substitutions will be possible 
without altering the biological activity of the polypeptide. 

25 Suitably modifications will be at least 60% indentical, 

preferably at least 75% identical, and more preferably at least 
90% identical to the base sequence. 

Identity in this instance can be judged for example using 
the algorithm of Lipman-Pearson, with Ktuple:2, gap penalty: 4, 

30 Gap Length Penalty: 12, standard PAM scoring matrix (Lipman, D.J. 
and Pearson, W.R., Rapid and Sensitive Protein Similarity 
Searches, Science, 1985, vol. 227, 1435-1441) . 

In particular, the invention comprises a nucleic acid which 
encodes enzymes of SEQ ID NOS 3-17. 



WO 03/102191 



PCT/GB03/02338 



A preferred example of such a nucleic acid comprises SEQ ID 
NO 1 or a variant thereof- In particular the nucleic acid is of 
SEQ ID NO 1. 

The term "variant thereof" in relation to a nucleic acid 
5 sequences means any substitution of, variation of, modification 
of, replacement of, deletion of, or the addition of one or more 
nucleic acid(s) from or to a polynucleotide sequence providing 
the resultant protein sequence encoded by the polynucleotide 
exhibits the similar properties as the protein encoded by the 
10 basic sequence. The term therefore includes alleleic variants, 
degenerate variants which encode similar proteins but vary only 
as a result of the degeneracy of the genetic code. It also 
includes a polynucleotide which substantially hybridises to the 
polynucleotide sequence of the present invention. Preferably, 
15 such hybridisation occurs at, or between low and high stringency 
conditions. In general terms, low stringency conditions can be 
defined as 3 x SSC at about ambient temperature to about 55°C and 
high stringency condition as 0 . 1 x SSC at about 65°C. SSC is the 
name of the buffer of 0 . 15M NaCl, 0.015M tri-sodium citrate. 3 
20 x SSC is three times as strong as SSC and so on. 

Typically, variants have 65% or more of the nucleotides in 
common with the polynucleotide sequence of the present 
invention, more typically 70%, preferably 75%, even more 
preferably 80% or 85% and, especially preferred are 90%, 95%, 
25 98% or 99% or more identity. 

Variants may comprise the basic sequence which has been 
modified to ensure that the codon usage is enhanced or 
optimised, as would be understood in the art, for a particular 
organism in which it is required that the sequence is expressed 
30 in a desired organism, for example a prokaryotic cell such as E. 
coli. This may involve modifying the content of particular 
nucleotides, for instance changing the percentage of G and C 
present in the sequence, to suit that usually found in genes 
which are highly expressed in the target organism. In addition, 
35 particular variants of SEQ ID NO 1 are synthetic variants, 

engineered to remove codons rarely found in highly expressed 
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genes from common expression hosts such as E. coli and, at the 
same time, avoid the introduction of codons rarely found in 
genes coding for O-antigens. For example, wherever possible the 
codons for the amino acids arg, leu, ile, gly and pro are 
5 changed to CGT or CGC (arg), CTG, CTT or CTC (leu), ATC or ATT 
(ile), GGT or GGC (gly), and CCG .CCA or CCT (pro), thus 
eliminating rare codons. 

When comparing nucleic acid sequences for the purposes of 
determining the degree of identity, programs such as BESTFIT and 
10 GAP (both from Wisconsin Genetics Computer Group (GCG) software 
package) . BESTFIT, for example, compares two sequences and 
produces an optimal alignment of the most similar segments. GAP 
enables sequences to be aligned along their whole length and 
fins the optimal alignment by inserting spaces in either 
15 sequence as appropriate. Suitably, in the context of the 
present invention when discussing identity of nucleic acid 
sequences, the comparison is made by alignment of the sequences 
along their whole length. 

SEQ ID NO 1 comprises a series of genes which encode a 
20 number of enzymes which are shown hereinafter in Figure 5 and 

SEQ ID NOs 3-17. Preferably any variants of SEQ ID NO 1 encode 
enzymes of SEQ ID NOS 3-17 or modifications of these. 

The expression "fragment" used in relation to amino acid 
sequences refers to any portion of the given amino acid sequence 
25 which has the same activity as the complete amino acid sequence. 
Fragments will suitably comprise at least 2 0 and preferably at 
least 50 consecutive amino acids from the basic sequence. 

The term "fragments" is also used in relation to nucleic 
acid sequences. Fragments of SEQ ID NO 1 may have applications 
30 in diagnostics, and these form a further aspect of the 

invention. For diagnostic purposes, fragments may be quite 
short, for example from 5-30 bases, which may be used as primers 
or probes. Particular characteristic regions of SEQ ID NO 1 
from which suitable fragments for diagnostic purposes may be 
35 identified are elucidated hereinafter. Fragments which are 
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useful in therapy would generally be expected to be longer, for 
example from 600-17,000 bases long. 

A region of genome of the F . tularensis strain Schu 24 
(subspecies tularensis) which includes SEQ ID NO 1, and which is 
5 responsible for expression of the set of enzymes necessary for 
constructing the polysaccharide, has been identified- It is 
shown hereinafter in Figure 6 as SEQ ID NO 41- This sequence 
includes a number of genes including a series of genes that 
encode the enzymes illustrated in Figure 5 hereinafter as SEQ ID 
10 NOS 3-20. Putative functions were applied to these genes by 
comparison with known sequences as illustrated in Table 1. 



Table 1 



SEQ ID 


F. tularensis 


Gene 


Putative function 


NO 


protein 


product 








size (aa) 




2 


Transposase 


247 


Hypothetical protein 








Transposase 


3 


WbtA 


578 


Sugar epimerase 


4 


WbtB 


205 


Galactosyl transferase 








Glycosyl transferase 


5 


WbtC 


263 


UDP-glucose 4 -epimerase 


6 


WbtD 


363 


Sugar transferase 


7 


WbtE 


436 


LPS biosynthesis 








Dehydrogenase 


8 


WbtF 


323 


C 4-epimerase 


9 


Wzy 


409 


Membrane protein / O-antigen 








polymerase 


10 


WbtG 


366 


Transferase 


11 


WbtH 


628 


Asparagine synthetase 


12 


Wbtl 


360 


Sugar transaminase / 








perosamine synthetase 


13 


WbtJ 


241 


Formyl transferase 


14 


Wzx 


495 


o-antigen f lippase 


15 


WbtK 


286 


Glycosyl transferase 
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SEQ ID 


F . tularensis 


Gene 


Putative function 


NO 


protein 


product 








size (aa) 




16 


WbtL 


294 


Glucose- 1-phosphate 








thymidylyltransf erase 


17 


WbtM 


348 


dTDP-D-glucose 4 , 6- 








dehydratase 








dTDP-D-glucose 4 , 6- 








dehydratase 


18 


Transposase 


126 


Transposase 


19 


ManC 


468 


Mannose-1 -phosphate 








guanylyl transferase 


20 


ManB 


494 


phosphomannomutase 



In particular the proteins illustrated as SEQ ID NOS 3-17 
are believed to be involved in O-antigen biosynthesis. The O- 
antigen itself has applications both in diagnostics and as a 
5 prophylactic or therapeutic vaccine. 

When the nucleic acids of the invention are expressed 
together in a host cell, they will result in the construction of 
an antigen that produces an immune response in an animal 
including a human, which is protective against infection by F. 

10 tularensis . Thus they may be used in the production of 
prophylactic or therapeutic vaccines. 

The nucleic acid may be included in a vector such as a live 
viral vaccine, for instance, adenovirus vector or vaccinia, or 
in a plasmid to form so-called "naked DNA" vaccines, or 

15 preferably in a bacterial vector such as attenuated Salmonella 
species. In this case, the nucleic acid will be under the 
control of suitable control elements such as promoters, signal 
sequences, enhancers and the like, as would be understood in the 
art. In this case, the nucleic acid is expressed either within 

20 the cells of the patient to whom the vaccine is administered, or 
in the case of bacterial vectors, within the host cell itself. 
As a result a series of enzymes are produced which are able to 
construct the protective O-antigen in situ. 
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The vector is suitably combined with a pharmaceutical^ 
acceptable carrier in a vaccine formulation. The nature of the 
carrier depends upon the type of vector being used, as would be 
understood in the art. In particular, when the vaccine 
5 comprises a recombinant Salmonella, it is suitably in the form 
of a composition which is suitable for oral administration. 

Alternatively, the nucleic acid may be included in an 
expression vector which is used to transform a host cell. 
Suitable host cells are prokaryotic or eukaryotic cells, but are 

10 preferably prokaryotic cells such as E. cold. In particular, 
the nucleic acid used is a synthetic variant of SEQ ID NO 1, 
optimised for expression in the particular host cells. The 
protective O-antigen can then be recovered from these cells 
after culture thereof. 

15 Thus in a further aspect there is provided a method of 

preparing a prophylactic or therapeutic vaccine, which method 
comprises transforming a host cell with a nucleic acid of the 
invention, culturing said host cell, and recovering a moiety 
which produces a protective immune response against F. 

20 tularensls therefrom. 

Expression vectors and host cells for use in this method, 
together with the product thereof form further aspects of the 
invention. 

Vaccines of this type will suitably be in the form of a 
25 pharmaceutical composition, in which the antigen is combined 
with a pharmaceutically acceptable carrier, as would be 
understood in the art. 

The compositions of the invention may be in a form suitable 
for oral use, for administration by inhalation (for example as a 
30 finely divided powder or a liquid aerosol), for administration 
by insufflation (for example as a finely divided powder) or for 
parenteral administration (for example as a sterile aqueous or 
oily solution for intravenous, subcutaneous, intramuscular or 
intramuscular dosing or as a suppository for rectal dosin. 
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The compositions of the invention may be obtained by 
conventional procedures using conventional pharmaceutical 
excipients, well known in the art. 

For further information on Formulation the reader is 
5 referred to Chapter 25.2 in Volume 5 of Comprehensive Medicinal 
Chemistry (Corwin Hansen; Chairman of Editorial Board), Pergamon 
Press 1990. 

The amount of active ingredient that is combined with one 
or more excipients to produce a single dosage form will 

10 necessarily vary depending upon the host treated and the 

particular route of administration. For example, a formulation 
intended for oral administration to humans will generally 
contain, for example, from 0 . 5 mg to 2 g of active agent 
compounded with an appropriate and convenient amount of 

15 excipients which may vary from about 5 to about 98 percent by 
weight of the total composition. Dosage unit forms will 
generally contain about 1 mg to about 500 mg of an active 
ingredient. For further information on Routes of Administration 
and Dosage Regimes the reader is referred to Chapter 25.3 in 

20 Volume 5 of Comprehensive Medicinal Chemistry (Corwin Hansch; 
Chairman of Editorial Board), Pergamon Press 1990. 

The size of the dose for therapeutic or prophylactic 
purposes of moiety of the invention will naturally vary 
according to the age and sex of the animal or patient and the 

25 route of administration, according to well known principles of 
medicine. 

Thus in a further aspect the invention provides recombinant 
O-antigen of F. tularensls which is obtainable from a host cell 
which expresses proteins of SEQ ID NO 3-17, or modifications 
30 thereof. 

Furthermore, the applicants' realisation of the sequence of 
the O-antigen sequence provides the possibility that this 
sequence can form the basis of diagnostic tests, to determine 
whether a patient has an F. tularensls infection. In such case, 
35 samples such as blood or saliva samples may be taken from the 
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patient and the presence of SEQ ID NO 1 or variants thereof 
detected . 

Specific detection methods are well known in the art, and 
may include amplification procedures such as the polymerase 
5 chain reaction (PCR) and/or other detection methods using for 
example labelled probes that hybridise to the target sequence 
and particularly SEQ ID NO 1. Primers and probes of this kind 
form a further aspect of the invention. 

By selection of particular primers and probes, it may also 
10 be possible to allow differentiation between strains of F. 

tularensls infection. For instance, the applicants have found 
that primers comprising SEQ ID NOS 21 and 22 and 35 and 36 set 
out hereinafter will allow distinction between strains of F . 
tularensls subspecies tularensls, and F. tularensls subspecies 
15 holarctlca as described below. In the former case, this 

possibility arises because of differences in the downstream 
sequence, and in the latter, because of differences in deletions 
in the flanking transposase sequence. Consequently, analysis 
using primers or probes based upon these regions may be used to 
20 determine whether- any particular strain is F . tularensls 
subspecies tularensls or otherwise. 

In order to discover whether the LPS from a subspecies 
tularensls strain has similar structure (and properties) to that 
from a subspecies holarctlca strain, LPS from F. tularensls 
25 strain Schu S4 (subspecies tularensls) was extracted. 

LPS extracted from F . tularensls strain Schu S4 was shown 
to have a characteristic ladder pattern after gel 
electrophoresis. However, the LPS was difficult to stain and 
required additional oxidation in order to visualise the O- 
30 antigen bands. This may suggest that the sugars in the O- 

antigen of F. tularensls strain Schu S4 are not oxidised in the 
same way as the O-antigen sugars found in most other bacteria. 

The F. tularensls strain Schu S4 O-antigen gene cluster 
contained 15 genes, the putative functions of which was assigned 
35 (see Table 1 above) based on the BLAST results and structural 

information about the sugars contained in the O-antigen . Genes 
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within the cluster are likely to be responsible for the 
production of the O-antigen molecule as well as the 
transportation of the molecule out of the bacterial cell. 

There are two main O-antigen synthesis modes, O-antigen 
5 polymerase (wzy) -dependent and wzy-independent. In the wzy- 
dependent system it is thought that the polymerase (wzy) , 
flippase (wzx) and chain length determinant (wzz) are part of a 
complex in the cell wall which facilitates polymerisation and 
export of the LPS molecule. In the wzy-independent system a 
10 different set of proteins are involved in the transportation and 
polymerisation of the LPS molecule. The transporter is ATP 
driven and composed of two proteins wzt and wzm that belong to 
the ABC-transporter family. 

In the F. tularensis O-antigen gene cluster, proteins with 
15 high identity to wzy and wzx are present, suggesting that 

transportation and polymerisation of the O-antigen is via a wzy- 
dependent pathway . 

The TMHMM analyses of the putative O-antigen flippase (Wzx) 
and polymerase (Wzy) proteins supported their assigned functions 
20 based on sequence similarity. The predicted numbers of trans- 
membrane helices for the F. tularensis proteins of 14 and 11 for 
Wzx and Wzy respectively are similar to those predicted for 
other bacteria, in which these cytoplasmic membrane proteins 
have been predicted to have around 10-12 trans-membrane helices. 
25 The prediction of 2 large periplasmic domains for the F. 
tularensis Wzy protein is consistent with the two large 
periplasmic domains of the Shigella flexneri Wzy protein. 

No gene that could encode a Wzz homologue was identified, 
which may indicate that one is not present in the F. tularensis 
30 genome. 

The close proximity or overlapping of the genes wbtA to 
wbtL suggests these are transcribed as one operon. 

Approximately 0.5Kb downstream is wbtM, which has a 
putative promoter of its own. Downstream of the second 
35 transposase are manC and manB, which also have their own 

putative promoter and are probably not involved in biosynthesis 
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of the O-antigen as mannose was not found to be part of the 
structure of the F. tularensis O-antigen, nor is it one of the 
intermediate products required for its synthesis. 

The two genes manC and manB may once have been involved in 
5 biosynthesis of the O-antigen in an ancestor of F. tularensis . 

The presence of transposases flanking the O-antigen biosynthetic 
gene cluster wbtA to wbtM suggests this cluster may have been 
horizontally acquired, perhaps replacing an ancestral 
polysaccharide gene cluster. 
10 The O-antigen gene cluster appears to be present in all 

subspecies tularensis and B strains screened. However, there is 
at least one difference between the clusters in subspecies 
tularensis and B strains within a region containing a 
transposase. BLAST analysis using the partially deleted 
15 transposase has revealed possibly over 50 copies of it in the F . 
tularensis Schu S4 genome. It is possible that the insertion 
sequence originated in the F. tularensis genome from 5. 
pneumonia and was . copied randomly within the genome. The open 
reading frames flanking the insertion sequence have no 
20 significant homology within the F. tularensis genome, suggesting 
that these genes were not imported to this locus with the 
insertion. 

In subspecies tularensis strains, this insertion has become 
deleted to leave only fragments of the transposase and 

25 downstream sequence. The overall similarity between the 

subspecies tularensis and subspecies holarctica clusters seems 
to indicate that the insertions took place in F. tularensis 
before division of the subspecies. Partial deletion of the 
subspecies tularensis transposase would have the effect of 

30 stabilising this region of DNA, as this enzyme has been found to 
be necessary for insertion events to take place. 

It seems unlikely that this will affect expression of the 
cluster in either subspecies tularensis or B strains. It could 
be that in subspecies tularensis strains part of the transposase 

35 has been lost due to genome down sizing. However, the gross 

difference in size of PCR products generated across this region 
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when amplifying DNA from different subspecies may be utilised in 
diagnostic procedures. 

The applicants have found that a similar O-antigen gene 
cluster to that found in F . tularensls strain Schu S4 is present 
5 in other strains of F. tularensls. This includes subspecies 

holarctlca strains. Consequently, a vaccine which utilises the 
O-antigen to produce a protective immune response is likely to 
provide protection against infection by several virulent strains 
of F. tularensls. 
10 The applicants have demonstrated that LPS from F. 

tularensls subspecies tularensls strains is protective. In 
particular, it appears to be protective against challenge from 
strains other than F. tularensls subspecies tularensls, and in 
particular against challenge with F. tularensls subspecies 
15 holartlca. This finding is unexpected in view of the results 
reported above which suggest that LPS from F. tularensls 
subspecies holartlca is not protective against infection by 
other F. tularensls species. Thus recombinant vaccines as 
described above will be particularly useful. 
2 0 Thus in a further aspect, the invention provides LPS 

obtainable from F. tularensls subspecies tularensls for use as a 
vaccine against infection by F. tularensls. Vaccine 
compositions containing LPS from F. tularensls subspecies 
tularensls are also novel and form a further aspect of the 
25 invention. These will comprise pharmaceutically acceptable 
carriers as described above. 

The invention will now be particularly described by way of 
example with reference to the accompanying diagrammatic drawings 
in which: 

30 

Figure 1. SDS-PAGE analysis of LPS isolated from E. coll strain 
K325, 1.25 \iq (track 1) and F. tularensls strain Schu S4, 5 0 
(track 2 ) . 
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Figure 2. The genetic organisation of the O-antigen gene cluster 
in F. tularensis strain Schu S4 . The G+C content of the O- 
antigen cluster is shown in the upper panel. 

5 Figure 3. Schematic structure of an O-antigen subunit of F. 
tularensis strain Schu S4 and the assignment of putative 
functions to the O-antigen gene cluster genes. A single O-unit 
is shown with sugar residues and glycosidic linkages indicated. 

10 Figure 4. Shows the region of the genome the nucleic acid 
sequence of the F. tularensis genome which encodes all the 
proteins shown in Figure 5 . 

Figure 5 . Shows the amino acid sequences of proteins encoded by 
15 SEQ ID NO 1, as well as a number of flanking gene sequences, 

Figure 6. Shows the nucleic acid sequence (SEQ ID NO 1) which 
encodes the enzymes necessary for O-antigen production. 

2 0 Example 1 
Methods 

Bacterial strains and growth conditions 

Bacterial strains used in this study are shown in Table 2 
and were cultured at 37°C on BCGA agar for 48 hrs . 
25 Table 2 



Species and Strain Subspecies 



F. 


tularensis 


Schu4 


tularensis 


F. 


tularensis 


199 


tularensis 


F. 


tularensis 


230 


tularensis 


F. 


tularensis 


041 


tularensis 


F. 


tularensis 


LVS 


holarctica 


F. 


tularensis 


200 


holarctica 


F. 


tularensis 


025 


holarctica 


F. 


tularensis 


075 


holarctica 


F. 


tularensis 


HN63 


holarctica 


F. 


tularensis 


147 


mediaasiatica 
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LPS purification 

LPS was purified from F. tularensis strain Schu S4 using a 
hot-phenol and water extraction method (Westphal O, et al . 
(1965) . Methods in Carbohydrate Chemistry 5: 83-91) . 
5 Gel electrophoresis and silver staining 

Glycine gel electrophoresis was performed according to the 
method of Laemmli (Laemmli UK. (1970). Nature 221 \ 680-685.) 
using a 12.5 % separating gel with a 4.5% stacking gel. Ten yl 
of each sample were electrophoresed for approx 2 h at 100 mV in 
10 the Mini-protean II slab system (Biorad) . 

Gels were silver stained according to the method of Chart (Chart 
H. (1994) LPS: Isolation and Characterisation. In: Raton B, 
Arbor A (eds.) Methods in Practical Laboratory Bacteriology. CRC 
Press, London, Tokyo, pp. 11-20). However, the oxidation step was 
15 increased to 10 min. 

Nucleotide sequence analysis 

The sequence encoding the O-antigen biosynthetic cluster 
was identified and extracted from the Known protein sequences 
(obtained from GenBank) involved in the biosynthesis of the 0- 
20 antigen of other bacteria were used to probe the F . tularensis 

Schu S4 partial genome sequence (Prior RG, et al. (2001) Journal 
of applied microbiology 91: 614-620), available at 
http : //artedi ■ ebc . uu . se/Pro j ects/Francisella/ , using TBLASTN 
(Altschul SF. et al. (1997) Nucleic acids research 25: 3389- 
25 34 02). The contig containing the putative O-antigen gene 
cluster was extracted and subsequently analysed using the 
annotation tool Artemis 

(http://www.sanger.ac.uk/Software/Artemis) . This allowed 
visualization of BLASTN, BLASTX and BLAST P searches, GC content 
30 and other analyses performed on the sequence and the predicted 
proteins . 

The protein sequences encoded by the putative O-antigen 
flippase gene (wzx) and O-antigen polymerase gene (wzy) were 
analysed for trans -membrane helices using TMHMM (Sonnhammer ELL, 
35 et al. (1998) In: Glasgow J, Littlejohn T, et al. (eds.) The 
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sixth international conference on intelligent systems for 
molecular hiology. AAAI Press, Menlo Park, CA, pp. 175-182). 
PCR analysis of the putative O- antigen gene cluster 

DNA was prepared from the F. tularensis strains shown in 
5 table 1, by phenol extraction, as described by Karlsson et al 
2000 (Microti. Comp. Genom. 5: 25-39). Ten pairs of overlapping 
PCR primers were designed to amplify the whole of the putative 
O-antigen gene cluster in approximate 2 kilobase segments using 
the DNAstar program PrimerSelect . The primers were designed 
10 with annealing temperatures ranging from 42 to 5 9°C, although all 
were used successfully at 49°C. 

The structures of these primer pairs is summarised in Table 

3. 



Table 3 



Primer 
set 


Forward/ 
reverse 


Structure 


SEQ ID 
NO 


1 


Forward 
Reverse 


AT AAT G AAA 1 CA/i 1 OOAOCjACj 
CCAGCCAGTCAGTCCCACAG 


22 


2 


Forward 
Reverse 


TGTCTTAGATATGGGGCAACC 
ACAAATATCAAATCCTAACACATC 


23 
24 


3 


Forward 
Reverse 


TAGAAGCAGCTGCGATAGGTAGAC 
TTAAATAAAAACTGAGGAAACA 


25 
26 


4 


Forward 
Reverse 


AT GGT AT T TTAATCAAGT GT 
CTAGTATGCCCCAGAGT 


27 
28 


5 


Forward 
Reverse 


T G GT GCGAC AAT C AAGT T A 
AGAAGTTCCTCCTCAGTC 


29 
30 


6 


Forward 
Reverse 


AGAAAT T AAG AGCAAAAGGAAAGT 
AT CT CAAAGT CAAAAT CAGT CT CT 


31 
32 


7 


Forward 
Reverse 


TACGATATTGTCCTCTCCGATTAG 
TAGTTGCGACATATTGACCTG 


33 
34 


8 


Forward 
Reverse 


AG GC AG GT C AAT AT GT C G C AAC T 
TTTCCGCAACACTTCAGCAACTT 


35 
36 


9 


Forward 
Reverse 


GCTATGGCCACTATCACGAGAGG 
TATACTTGCTTGCCCACTGCTTAG 


37 
38 


10 


Forward 


ACCGTAGTGAGCATTGGATTGT 


39 
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Primer 
set 


Forward/ 
reverse 


Structure 


SEQ ID 
NO 




Reverse 


ACTAGGGCCTCTGACCGTTCTC 


40 



PCR amplification using each pair of primers with each 
template DNA was carried out in the following mixture: lx PCR 
buffer (including 1 . 5 mM MgCl 2 ) , 0 . 2 mM deoxynucleoside 

5 triphosphates (dNTPs), 2 . 5 mM forward primer, 2 . 5 mM reverse 
primer, 2.0 ul template DNA, 0 . 5U Tag polymerase and filtered 
sterile water to a final volume of 20 ul. The reaction mixtures 
were incubated at 90°C for 1 min and then cycled at 90°C for 1 
min, 4 9°C for 1 min and 72°C for 2 min 25 s for 30 cycles, with a 

10 final incubation at 72°C for 10 min. PCR products were 

visualised on 0.5% agarose gels, with ethidium bromide staining. 
PCR buffer, dNTPs, and polymerase were from Roche. PCR primers 
were synthesised by MWG-Biotech. 
Cloning of PCR Products 

15 PCR products amplified from Schu S4, HN63 and LVS DNA using 

primer pair 8 were cloned into pGEM-T easy (Promega) for 
sequence analysis. Ligated DNA was transformed in E. cold JM10 9 
chemically competent cells (Promega) and putative clones were 
screened using both colony PCR and digestion with restriction 

20 endonucleases . All DNA manipulations, including ligations, 

transformations, colony PCR, restriction endonuclease digestions 
and agarose gel electrophoresis were carried out according to 
methods described by Sambrook et al (1987) Molecular cloning: A 
laboratory manual. Cold Spring Harbor, New York). 
. 25 Purification of PCR products from agarose gel was achieved 

using the QIAquick Gel Extraction Kit (Qiagen) according to the 
manufacturer' s instructions . 

The three constructs were sequenced at Oswel by the 
dideoxynucleotide chain-termination method (Sanger F, et al . 

30 Proc.Natl.Acad.Sci.U.S.A. 74: 5463-5467) using universal 

primers. Each sequence was compared and the BLAST (Altschul SF, 
et al. (1990) Basic local alignment search tool. J.Mol. Biol . 
215: 403-410) function of the ARTEMIS software package was used 
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for homology searches in the locally held GenBank databases to 
identify the functions of the differential regions of DNA. 
Mass spectrometry analysis of the O- antigen molecule 
Results 
5 LPS purification 

The hot phenol-water extraction method was used to purify 
LPS from 2.2 g of freeze dried F. tularensls strain Schu S4 . 
This resulted in 7 mg of LPS, which is a yield of 0.3 %. The 
LPS was difficult to visualise after SDS-PAGE and silver 
10 staining. The oxidation step was increased from 5 min to 10 min 
to visualise a ladder pattern (Fig 1) - 
F. tularensls O- antigen biosynthetic gene cluster 

The F. tularensls O-antigen biosynthetic gene cluster was 
found to be 17Kb in length and contain 15 genes putatively 
15 identified as being involved in O-antigen biosynthesis, flanked 
by two transposases (Fig 2) . Possible promoter sites were 
identified just upstream of the genes wbtA and wbtM. Downstream 
of the second transposase are located the genes manC and manB, 
with a possible promoter just upstream of manC. 
20 Figure 2 also shows the G + C content plot of the cluster 

using a window size of 500 bases. The overall G + C content of 
this region of the genome at 31.27% is slightly lower than the 
genome average of approximately 33%. The plot shows that the 
central section of the cluster, from wzy to wbtK, generally has 
2 5 an even lower G + C content. 

Downstream from manC, on the opposite strand, are located 
the genes for the transcription termination factor rho and 
thioredoxin. In E. coll both of these genes are also found 
flanking one end of a polysaccharide biosynthetic gene cluster - 
30 that of the enterobacterial common antigen. 

The O-antigen repeat unit of F. tularensls is shown in Fig 
3, together with the putative role of the genes involved in O- 
antigen biosynthesis. Based on their homology to other LPS and 
sugar biosynthetic genes, in particular P. aeruginosa serotype 
35 06 which expresses a similar O-antigen repeat structure (Knirel 
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YA, et al. (1985) Eur . J Biochem. 150: 541-550), the putative 
role of the gene products have been assigned. 

It is proposed that the biosynthesis of 2-acetamido-2 , 6- 
dideoxy-D-glucose (QuiNAc) involves WbtA, a dehydratase and 
5 WbtC, which shows homology to UDP-Glc 4-epimerases . WbtA and 

WbtC share homology to WbpM and WbpV of P. aeruginosa strain 06, 
both thought to be involved in QuiNAc biosynthesis and shown to 
be essential for 06 0-antigen synthesis. WbtE, WbtF and WbtH 
are proposed to be involved in 2-acetamido-2-deoxy-D- 
10 galactouronamide (GalNAcAN) biosynthesis. WbtF shows homology to 
UDP-glucose 4-epimerases, including WbpP and VipB, whilst WbtE 
shows homology to WbpO and VipA, UDP-GalNAc dehydrogenases 
involved in the formation of 2-acetamido-2-deoxy-D-galactouronic 
acid (GalNAcA) in P. aeruginosa and Salmonella enter! ca var 
15 typhi respectively. WbtH produces significant alignments with 
glutamine amidotransf erases , including WbpS of P. aeruginosa 
serotype 06, which may putatively be involved in the formation 
of the GalNAcAN amido group. Biosynthesis of the fourth sugar, 
4N-f ormyl-quinovosamine (Qui4NFm) most likely involves Wbtl, 
2 0 WbtJ, WbtL and WbtM. Sequence homology suggests that WbtL may 
be involved in the formation of the activated sugar dTDP-D- 
Glucose with WbtM functioning as a dTDP-D-Glucose 4,6- 
dehydratase. Wbtl is proposed to be involved in Qui4NFm 
amination since it shows homology to RfbE, a perosamine 
25 synthetase. Finally, WbtJ is likely to be responsible for the 

addition of the N-formyl moiety, showing significant homology to 
formyltransf erases . 

Specific glycosyltransf erases are required to form the 
oligosaccharide units of the O-antigen repeat. Four 
30 glycosyltransf erases would be necessary for the synthesis of 

each O-antigen unit in F. tularensis. Based on homology, WbtB 
is proposed to mediate the addition of QuiNAc to undecaprenyl 
phosphate (Und-P) to initiate O-antigen biosynthesis. WbtD and 
WbtG are probable GalNAcAN transferases, possibly involved in 
35 the addition of the two consecutive GalNAcAN residues onto the 
O-antigen unit. WbtD shares homology to WbpU of P. aeruginosa 
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strain 06, proposed to transfer 2-f ormamido-2-deoxy-D- 
galactouronamide (GalNFmAN) onto QuiNAc (Belanger M, et al. 
(1999) . Microbiology 145: 3505-3521) . WbtG is homologous to WbpT 
of P. aeruginosa, thought to be involved in addition of GalNAcA 
5 to GalNFmAN. WbtK is probably the fourth glycosyltransf erase , 
which adds 4 , 6-dideoxy-4-f ormamido-D-glucose (QuiNA4Fm) to 
complete the tetrasaccharide O unit. 
Wzx and Wzy 

Once assembled, the O-antigen repeat units are translocated 
10 to the periplasmic face of the inner membrane via Wzx, a 

transporter/f lippase . Wzy then acts as an O-antigen repeat unit 
polymerase. When analysed using TMHMM, the F. tularensis Wzx 
protein had a predicted 14 trans-membrane helices, with both 
termini on the cytoplasmic side of the membrane. The F . 

15 tularensis Wzy protein had a predicted 11 trans -membrane 
helices, with the . amino terminus predicted to be on the 
cytoplasmic side of the membrane, and the carboxy terminus on 
the periplasmic side. Additionally, the Wzy protein was 
predicted to have two large periplasmic domains from amino acids 

20 142-178 and 268-327. 

A gene with homology to the O-antigen chain length 
determinant (wzz) was not identified in the current F . 
tularensis Schu S4 sequence dataset. 
PCR analysis of the O-antigen gene cluster 

25 Eight of the PCR products (primer sets 2,3,4,5,6,7,9 and 

10) from each template DNA appeared to bethe same size when 
viewed by agarose gel electrophoresis. Primer pair 1, covering 
the start of the gene cluster, had to be designed to amplify a 
4.8 Kb region due to lack of suitable priming sites upstream of 

30 the cluster because of the presence of an insertion element 
found many times in the F. tularensis Schu S4 genome. This 
primer pair 1 produced the relevant size product for F. 
tularensis Schu S4, but when used on subspecies holarctica, 
strain LVS, did not produce a product. Thus this primer pair 

35 may have particular applications in diagnostics where 

distinction between F. tularensis subspecies tularensis and F. 
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tularensis subspecies holarctica is required. Where samples 
containing DNA from the former is present, a PCR using primer 
pair 1 will generate a product, which would not be present in 
the second case. 

5 The PCR using primer pair 8 revealed a difference in size 

between subspecies tularensis strains and subspecies holarctica 
and subspecies mediaasiatica . Subspecies tularensis strains 
show a deletion of 303 nucleotides when compared to subspecies 
holarctica strains (including LVS) and subspecies mediaasiati ca . 

10 Cloning and sequence analysis of this region from the subspecies 
tularensis strain Schu S4, the subspecies holarctica strain HN63 
and LVS has shown that the deletion in Schu S4 occurs at the 
beginning of a putative transposase that is similar to IS630-spn 
1 transposase ORF 1 of Streptococcus pneumoniae. 

15 Thus primer pair 8 may also be particularly useful in 

distinguishing between strains of F. tularenis. Following a PCR 
reaction on samples containing DNA using these primers, a 
separation of the. products on the basis of size, for example on 
a gel, should reveal distinguishable differences therebetween. 

20 

Example 2 

Protective Effects 
LPS purification 

LPS was purified from F. tularensis strain Schu S4 or from 
25 strain LVS using a hot-phenol and water extraction method 
mentioned above in Example 1 . 

Immunization with LPS and protection studies 

The ability of F. tularensis strain LVS or strain Schu S4 LPS to 
protect BALB/c mice from a F. tularensis was determined by 
30 immunizing groups of six female BALB/c mice by the i.p. route 
with the purified LPS obtained. On each dosing occasion, mice 
were given 50 \iq of LPS in phosphate buffered saline (PBS) . The 
mice received three immunizations, each 7 days apart. 

35 Mice were challenged i.p. with F. tularensis LVS (1 x 10 5 CFU) 21 
days after the last immunization. All control animals died after 
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challenge. Mice which had been immunised LPS isolated from the 
LVS strain were protected from death. Mice which had been 
immunised with LPS from either the SchuS4 or LVS strain showed 
and extended time to death. At a challenge dose of 10 cfu 
5 animals immunised with SchS4 LPS survived for an average of 64 
hours (with 99 % confidence) longer than the unimmunised 
controls . 



WO 03/102191 



23 



PCT/GB03/02338 



Claims 

1. A nucleic acid which encodes a series of enzymes or enzyme 
fragments which, when expressed together in a cell, are able to 

5 produce an immunogenic moiety able to produce an immune response 
in an animal to which it is administered, which response is 
protective against Franclsella tularensls infection, said 
nucleic acid encoding at least some enzymes of SEQ ID NOS 3-17 
or modifications of these. 

2. A nucleic acid according to claim 1 which encodes enzymes 
of SEQ ID NOS 3-17 . 

3. A nucleic acid according to claim 1 or claim 2 which 
15 comprises SEQ ID NO 1 or a variant thereof. 

4. An nucleic acid according to claim 3 which is of SEQ ID NO 
1 . 

20 5. A nucleic acid according to any one of the preceding claims 
wherein the codons have been optimised for expression in a 
bacterial cell . 

6. A nucleic acid according to claim 5 wherein the bacterial 
25 cell is E. cold. 

7. A nucleic acid comprising a fragment of SEQ ID NO 1 which 
may be used to detect the presence of SEQ ID NO 1 in a sample. 

30 8. A nucleic acid according to claim 7 which comprises an 
amplification primer. 

9. A nucleic acid according to claim 8 which is selected from 
SEQ ID NO 21, 22, 35 or 36. 

35 
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10. A live vaccine vector, which comprises a nucleic acid 
according to any one of claims 1 to 5 . 

11. A live vaccine vector according to claim 10 which comprises 
5 a bacterial vector. 

12. A live vaccine vector according to claim 11 wherein the 
bacteria is a Salmonella species. 

10 13. A vaccine comprising a live vaccine vector according to any 
one of claims 10 to 12 in combination with a pharmaceutically 
acceptable carrier . 

14. A method of preparing a prophylactic or therapeutic 
15 vaccine, which method comprises transforming a host cell with a 
nucleic acid according to any one of claims 1 to 6, culturing 
said host cell and recovering a protective immunogenic moiety 
from the culture. 

20 15. An expression vector comprising a nucleic acid according 
to any one of claims 1 to 6 . 

16. A host cell transformed with a vector according to claim 
15 . 

25 

17. Recombinant O-antigen of F . tularensls obtainable by a 
process according to claim 14. 

18. A vaccine comprising recombinant O-antigen according to 
30 claim 17 in combination with a pharmaceutically acceptable 

carrier. 

19. A method of diagnosing infection by F. tularensls 
infection, which method comprises detecting in a sample taken 

35 from a patient suspected of having an infection a nucleic acid 
sequence according to claim 7. 
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20. A method of differentiating between strains of F. 
tularensis, which method comprises selecting primers or probes 
which are specific for SEQ ID NO 1, and not for similar 
sequences in subspecies other than F. tularensis subspecies 

5 tularensis, or which produce distinguishable products when used 
to analyse other species, and conducting an analysis using the 
said primers or probes . 

21. A method according to claim 20 wherein the analysis is 

10 conducted using a polymerase chain reaction (PCR) and a pair of 
primers . 

22. A method according to claim 21 wherein the primers are 
specific for a start region of SEQ ID NO 1. 

15 

23. A method according to claim 22 wherein the primers are of 
SEQ ID NO 21 and SEQ ID NO 22. 

24. A method according to claim 23 wherein the primers are 
20 specific for the end transposase coding region of SEQ ID NO 1. 

25. A method according to claim 24 wherein the primers are of 
SEQ ID NO 35 and SEQ ID NO 36. 

25 26. Lipopolysaccharide (LPS) obtainable from F. tularensis 

subspecies tularensis for use as a vaccine against infection by 
F. tularensis . 

27. LPS according to claim 2 6 where the strain of F. tularensis 
30 subspecies tularensis is the Schu4 strain. 

28. A pharmaceutical composition comprising LPS according to 
claim 26 or claim 27 in combination with a pharmaceut ically 
acceptable carrier . 

35 
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Fig.4. 

SEQ ID NO 41 

1 tcttttataa atgatgatag caaacaaaaa ataataggtt ctgtgcacaa aaaacttaaa 
61 ttatattgaa aatagctaaa ccgctgttat ttaagatttg aaaagcaata aatatcaatg 
121 gtttagcaaa tgaattatca tataaaagaa gtattctggt caattatttt atcattctta 
181 aaatcacaaa aaggtataca taccaatgat gaagccaaat taagattgtt tattgaagct 
241 gtattttatg tgttacgtac aggctgtcaa tggagaatgt. taccatttta ttatggtaaa 
301 tatagatcaa tacataagcg ttttaaagat tggtgtgata aagatatatt ttctagatta 
361 tttaaatcag tacaaaaccc tgatttacaa gaagtcatgc ttgattcaac aatagcaaga 
421 gcacatgctt gtgctacggg atatgataaa gatgataacc aagcaattgg tagatcagtt 
481 ggtaggataa ccactaaaat ccatgctatg actgatgctt taggtaatcc aatagaaata 
541 ttgttgtcag aggataaaac tcatgatagt aaagtagcta tagatttact aaaaaatgta 
601 tataatacaa aagttatcgc tgatagagca tatcattcta atgaaatcag gcagcatatt 
661 caaggtatat cctctgaagc tgttatccct tgtaaatcaa atactctaaa ccatatacct 
721 tttgatagtc atgtatataa agaaagacat ttgatagaga atttcttttc taaaattaag 
781 cattttagaa gagtattctc tagatttgat aaaaccattt tagcatatat aggaatgatt 
841 aaattagctt gtacttttat ttggttacga tgaatattta tttttgtgca cagaacctaa 
901 tttgcatttt tgtgcacaaa gaaaattttt ttgatataat agactttaat aggatatttt 
961 ctaaaaatta acaaatgtct ttctacgata atagaacgct taatttcgtg gtaataatag 
1021 ttttaactat tattactgtt aattggactt tctatatttt caagcaagat gttaatttac 
1081 attttttact tgcattagtt ttgctgagat gcttgtcatc ttttttacta cttagagatt 
1141 atatggctag ttggcgtaag tcgactcaaa aaactttttt acgtaaggct tttattaatt 
1201 tgccagtatt tttcatagtg gcattatttt tttatggcaa agtcactttt tcgttgatat 
12 61 tctctgagtt tttattttat gtttttttga tcagtttaag tgtctacttt tattggtatt 
1321 tgatgaacag aggatcagtg gataaaagta aaactgcggt tatttatggt gcaggtgctg 
1381 caggaacaaa gattgctcaa gaacttgctt ctgctggtta tcgcatcaaa tgttttgttg 
1441 atgacaatga aactttacaa aaaagaagta ttgatagtaa aaaggttcta tctaaagctg 
1501 aattaacaaa actattgcta tctagtagat ttgacctttt ggttattgca ttgccaagaa 
1561 atgcaaacca agtagtcaaa aatatatata aagaatttga aaaggatttt aatcagatta 
1621 gaattatgcc gcctcttgag gaaattcttc aagatgagaa ttttatgtca cagttgaagc 
1681 ctgtttcact ctatgatcta ttagcgcgtg atactaagag tttagataaa gaatctatct 
1741 ctaattttat caaaaataag gtggtgctag tcacaggagc tggaggtagt ataggttctg 
1801 aaatagtaca tcaatgtatc aagtatcagg caaaagagtt gatattggtt gatcatagtg 
1861 agtttaactt atataaaatt actgaggagt gtagtcattt taatatcaat agtgtgctat 
1921 gttctgtttg tgatagaaaa gcattggctg aggtttttca aaagtatact ccaaatatag 
1981 tatttcatgc tgctgcctac aagcatgttc ccttagttga ggagaatatc tctagagcaa 
2041 ttagaaataa tatcttaggt actaagaatg ctatagatct ggctatagaa gctggtgttg 
2101 agtcatttat attgatttcc actgataaag cagtgcgacc aacgaatgtt atgggggcta 
2161 ccaagagagt. ttgtgagctg tatttacaga atgttgatcc caaaaatacc aagcttgctg 
2221 cagtgcgttt tggtaatgtg cttggtagta gtggcagtgt gattccaaaa tttgaagagc 
22 81 aaataagaaa aggtggtcct gttacagtta ctcatcctga aattacacgt tattttatgt 
2341 tgataccaga agcttgtgaa ctggtcctac aagctggtgc tattgcaaaa aattcagagg 
2401 tctttgtctt agatatgggg caacctgtca agattattga tcttgctaaa caatttatta 
2461 gactttctgg tagaggtgat attgatatta aaatagttgg tttgcgtcca ggagagaaac 
2521 tttacgaaga gcttttgata gaggaagatg atgttagtac cgactataaa gatattttta 
2581 ttggtagaag gactttttac gatattaata ctctaaacca agatattgaa tcgttgatca 

2 641 aggatgatgt tgatcagctt gtgatattaa agaaaattgt tccggaattt gaacatagat 
2701 tgaatgggta gtggttttat gttttatgag gtttttaaaa gattgcttga tattttactt 
2761 tcttttatgg ggttgttgtt attaagtcct attttcttaa ttattatttt tatgataaag 
2821 aaagattcaa aaggacctat attttttaaa caaaagcgct atggtaaaga taagcaattt 
2881 ttttacatat ataagtttag aactatgtat gttgatactc caaaagatat gccaacgcac 
2941 atgttacagg atccatcgaa atgtataact aaggttggag gatttttaag gaaatcatct 
3001 ttagatgagt tgccacaaat tataaatatt ctaaaaggtg aaatgagcat cgtgggtcca 
3061 agaccagcat tatggaatca agatgactta atagcacaaa gagataagta tggggcaaat 
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Fig.4 (Cont I). 

3121 gctgtgcctg tgggactgac tggctgggca cagattaatg gtagggatga attaccaata 

3181 cctgataaag ctaaacttga tggtgattat gtaaaaaata aaagtacatg gtttgattta 

3241 aaatgtattt ttttgacagt attttctgtt tttgccaaaa agggcgtcgt tgagggtggt 

3301 actggagctt taggtaacaa agaggattta aagtagtatg aaaaaaagaa tcttagttac 

3361 aggtttgagt agctatattg gtaactcatt tgcggctaaa tataactcag attttagtat 

3421 cgataaaata tctttgcgcg atgtttcgtg ggcaaatata gacttaagtg gttatgatgc 

3481 tgtattgcat gtcgctggaa ttgcccatac ttcaaaggat cctaaactaa aagaaaaata 

3541 ctataaaata aatacgcaat taacttatga tctggcaaaa caagctaaag atcaaggtgt 

3601 tcgacagttt gtgtttttaa gtagtattat agtttatggt gatagtgcgc caataggtca 

3661 acaaaaagtt ataactaaat ataccgaacc taaaccagat gatttttatg gagatagtaa 

3721 gcttcaaact gaaattaagc taaatagcct ggctagtgat gactttaata ttgctataat 

3781 cagaccacca atggtatatg gagaaggctc aaaaggcaac tatccaaagt tggttaaact: 

38 41 tgcaaagtat acttttattt ttcctaatat taataaccaa agaagtgtta tatctataga 

39 01 taatttatct aaagagattg cagaaataat tttgcaaact aaacatggag tttttctact 
3961 tcaagataat gaatattttt gcacttcaca gtttataaaa aactatagaa aagatgtttt 
4021 aggtaagaga acttatctga caaaaatttt taatccaatt ataagattgc ttgctaaaaa 
4081 agtagatttt attaataaag tttttgggaa tttgacttat gagaagtaag ttattattca 
4141 tagctaatga ttttgatatt gtaatatatc gtttcagaag agaagtaatc gagtcttttg 
4201 ctgctaaaga gtatgagata gtactagtaa caccatattc taagaaagca gaggtttttt 
42 61 gtaaaagtct tggtgttaag tatataaatg ttgatataga tagacgaggc aaaaatcctt 
4321 ttaaggattt gcttctttta tttaactatt tcaaaataat aaaaaaagaa aaacctgatt 
4381 acatttttag ctatacaatt aaaccaaatt tgtatgttgg gttagtgaat ttgtttttta 
4441 ggaagaagtt ttatccaaat gtaacaggct taggaagtgt ttttgctaat catggtattg 
4501 ttcagaagtt tataatatct ttatataagt tatcatttaa aagcaccaca aaagtattct 
4561 ttcagaatga gcaaaataaa aagttattta tagctaagaa aataatcagt ggagaaaaat 
4621 caatattatt accaggttct ggggtaaact tagatgaaaa taaatatgtt gactatccta 
4681 aagaccaagg aatattaaaa ttcgtttttc ttggccgaat aatgaaagaa aaggggattt 

47 41 atgaattgtt agaagccttt gctatacttg agaaaaaata taaaaatatt agtcttgaca 
4801 tttatggttt ttgtgatgaa aataaatcta attttatggg aaaggttaat acgataaaat 

48 61 cagtaaaatt ttatggtttt actgataata ctaaagaaaa aatagctagt gcacatgcag 
4921 ttgttttgcc atcttaccat gaaggaatgt caaatgtgct gttagaagca gctgcgatag 
4981 gtagacctgt aattgcgtca gatattcctg ggtgtagaga aatttttgat gatggtctct 
5041 ctggcttatc atgtaaccct aatgatgtga gttctttacg taactcatta gagcagttta 
5101 taaatatgtc gtatactgat aaaatagcta tgagctataa agctagagct aagatagaaa 
5161 aagattttga tagaagtatt gttgtcaatg catacttaca gcaaaattaa taataagggt 
5221 ttaaattatg agtttatatg aggatatagt cgctaaaaga gaaaaggttt cattggttgg 
5281 cttgggttat gttggtttac caatagctat tgcatttgca aaaaaaatag atgtgttagg 
5341 atttgatatt tgtgaaacaa aagttcaaca ttataaggat ggttttgatc caacaaaaga 
5401 agtaggagat gaggctgtca gaaatacgac aatgaaattt agttgtgatg aaacaagtct 
5461 taaagagtgt aaatttcata ttgttgcagt tcctacacca gttaaagcag ataaaactcc 
5521 tgatttgacg ccgattatta aggcaagtga gacggttggt aggaatcttg tcaaaggcgc 
5581 ttatgttgtg tttgaatcaa ctgtttatcc tggtgttaca gaagatgttt gcgtaccaat 
5641 acttgaaaaa gagtctggct tgaggtctgg tgaagatttc aaagttggtt actctcctga 
5701 gaggataaat cctggtgata aggttcatag gttagaaaca attatcaaag tagtatctgg 
5761 tatggatgaa gagtctttag atactatagc aaaagtttat gagctagtag tagacgcagg 
5821 agtttataga gctagtagta taaaagtggc tgaagctgct aaggttatag aaaactctca 
58 81 aagagatgtt aatatagctt ttgttaatga gttatcgata atatttaatc agatgggtat 
5941 tgatactcta gaggttttag cagcagctgc aactaaatgg aatttcttaa actttaagcc 
6001 tggtcttgtt ggtggacatt gtattggtgt tgacccatat tacctaacgt acaaggcagc 
6061 tgagcttgga tatcattctc aggtaatatt atctggtcgt aggataaatg atagtatggg 
6121 taaatttgta gttgagaatt tagtcaaaaa actgatatct gcagatatac ctgttaagcg 
6181 agctagagta gcaattttcg gctttacttt taaagaagac tgtcctgaca ctaggaatac 
6241 tcgagttata gatatggtaa aagagctcaa cgagtatggt atagagccat atattataga 
6301 tccggtagct gataaagaag aggctaaaca tgagtatgga cttgagtttg atgatctaag 
6361 taaaatggtc aatctagatg cgatcattat tgctgttagt cacgaacagt ttaaagatat 
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Fig.4 (Cont II). 

6421 aacaaagcaa cagtttgata ggctatatgc gcataattct agaaagatta tatttgacat 

6481 caaaggtagt ttagataaat ctgagtttga aaaagattat atttattgga gattgtagtg 

6541 gcttacgata atgttaaatt tcctcatggt tcgttttttt tggtgactgg aggtgcgggt 

6601 tttattggct ctaatttatg tgaagtttta cttagtaagg gttatagagt taggtgttta 

6661 gatgatctct caaatggtca ctatcacaat gttgagccgt ttttaactaa ttctaattat 

6721 gagtttataa aaggtgatat tagagattta gatacttgca tgaaagcttg tgaaggtatt 

67 81 gattatgttc tacatcaagc tgcttgggga agcgtaccaa gaagtattga gatgccatta 

68 41 gtgtatgaag atataaatgt taaaggtgca ttaaatatgc ttgaagcggc tagacaaaat 
6901 aacgttaaaa aatttgtcta tgcttctagt tcatcagtat atggtgatga gccaaattta 

69 61 cctaaaaaag aaggtagaga aggaaatgtt ttatcaccct atgcatttac aaagaaagct 
7021 aatgaagagt gggcgagact atacacaaag ttatatggtc tagatactta tggtctaaga 
7081 tattttaatg ttttcggtag aagacaagat cctaatggtg cgtatgcagc agttatacct 
7141 aaatttatca aacagttatt aaatgatgaa gcgccaacta taaatggaga tggtaaacag 
72 01 tcgagagatt ttacatatat agagaatgtt attgaggcaa atcttaaagc atgtttagca 
72 61 gatagtaagt atgccggaga gtcttttaat atagcttatg gaggtagaga gt.atct.tata 
7321 gatttgtact ataatctttg tgatgccttg ggtaaaaaaa tagagccaaa ttttggtcca 
7381 gatagagcgg gtgatattaa gcatagtaat gctgatattt cgaaggctag gaatatgctc 
74 41 ggatataatc cggaatatga ttttgaatta ggcataaagc atgctgttga gtggtattta 
7501 attaattaaa tggtatttta atcaagtgta cataaaaaaa gtgtctttta aaattttata 
7561 tttatattta ctagcttttt gtattatttt tagtttagaa tttaaatttg ctatattgaa 
7621 tattatagtt tatcttccgg cttgtatttt gggtttttta gctcttaaaa aactatttgt 
7681 cggaaatatt gttaagaaac aattagcttt cctttttttc tttttctttt tatcaatgat 
7741 ttatttaata atagtccaaa taatcttact tgatgcagca tcattgtttc ctcagttttt 
7801 atttaacatt ttgatcgcga taggtttttg taactttatt tttgtttcat atgataataa 
7861 tgaaaattat ttttttaata tgtctaaaat aatatttttt gttactttct tacaatctat 
7921 ttttgtattt ctttcaaggt attatatatt tttaaatgat tggatattct tttttttagt 
7 981 gaaaaaaggg aatattgaga tttcgaatgt tattgaatat aagttaagag tattcggact 
8041 tagtaacgct ggaggggatg gtttaggatt ttcaattact ataggattat gtttttctat 
8101 attttatttt atcaaatata ttaaaggtaa atctatattt accaaactta tgctgtttgt 
8161 acctttaatt cttattgtgt tttctaatat tttcatatct agaacatcac tcttaacttc 
8221 ttcacttata ttgttaataa caatatttta tatatatatt aaaaaagaaa aattactgtt 
82 81 tattataata ttggcgctat tctttttatc aatatggata ttgttcaaat taaatttgaa 
8341 tttgagttgg gcttttgaaa atatttactc gtacattcaa tctggcgatt tttcacatgg 
8401 aagtctaagt gttttaatca ataaaatgct ttttgtgcca gataaccttt tgacttggat 
8461 atttggttgt gaggatgtta gtaatactga tattggttat attaaatatt tatactatta 
8521 tgggattata tttagtatgt ttttttatat tcttattatt ttcttgtact ttgaaatgag 
8581 aaaatgtttt atattttcag agtatcgatc attatttcta ttgttgttaa tagtatgttt 
8641 agtttttcaa gcaaaaataa tttttttgac agtaggatta tttactaaat taaccattat 
8701 attatttatt ttttctctta aagaaaacag ctttacaact aggagtgtga tttgaaaagg 
8761 tttgtacatt taataataaa ccttaaccaa ggtggtgctg aaacaatgct ttataaactt 
8821 tgcaaatcta tggataagtc aatatatcat attacgatta tatcacttat gggtagggga 
8881 gtatttgcaa ataagttaga agcttatggt gttaaagttt atacattaaa tttaaataaa 
8941 tttaatgtac tatttgtatt gtttaaatat attaagatta tcagaagaat aaagcctgat 
9001 gttattcatg cttggatgta tcatgcaaat gtaatttcta tattatgcaa gcctttttat 
9061 agaaagacta aatatataaa tagtataaga atgggattgg agaattatga tggtcataag 
9121 aatcttacaa agtttatgat aaagttgaat gcaaaatttt ctaagttctc agatttaaca 
9181 ttaaataatt caaagaaatc attagaagat catcaaaata taggttttaa aaaccaatgc 
9241 tttatagcaa atggttttga taaagatgtt tttaaaccga gctttttaaa gtatgaaaaa 
9301 tttcgtttaa ataatgattt agatgataat gttaaaatta taggtatcat agcaagaaat 
9361 catgctgata aaaatatttc tcgtttctta caaatagcta atttattgtt aaaaagtaat 
9421 cctagtttac ggtttttaat tgctggaaga gagtgttcga aaatagatat aggtagttat 
9481 ctagataaca aaagtaatgt aaataagttt tttgtatttg aatctgtgga ttctagtgaa 
9541 tacttaccag tattagattt atatttgtct acatcaaaag ttgaaggttt tccaaatata 
9601 cttgcagaag ccatgctatg tgaagttcct attgttgctt ctaatgttgg agattgtaaa 
9661 gatatactta atggatacgg tgaagttttt gagcttagtc aaggtaataa agaaataata 
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Fig.4 (Cont III). 

9721 gaaaagatta tgaaagtttt agaaacaacg gtagtcatga aaaagcgcat gagagaatat 
9781 ataataaata attttagtat agaagctatt ttggaaaaac acgaaaaact ttatcatgag 
9841 ggcagtgtct aatgtgtgga gtagtaggct tttactcatt taataaagaa gaaggttttg 
9901 actcaataat taatcaatca ttgctttcta taaagcatag agggtcggat gatagtgggt 
9961 attggtgcga caatcaagtt actctggggc atactagatt atcaatacac gatataacta 
10021 atgcgggaca tcagccaatg ttatctaata gcggtaatac tgctattgtg tttaatggag 
10081 aaatatataa ttacttatcc ataaaaaatc agctattaag tgaatattca aatcttaaat 
10141 ttaaaagtaa cagtgatact gaggttttgg tcaatgctat tgaactttgg ggtatagata 
102 01 aaactttaga aaaatgcata ggaatgtttg cttttggagt ttacagtaga aaaactagtt 
102 61 gcttaatact agctagagat agatttggcg agaagccatt atattttggt atccaaaatg 
10321 gtattttggg ttttgcatca gaattgaagg cacttaagcc attaaaggaa tgtggctgga 
10381 ggtttgatat agatagagat gctttagcaa catatatgag gtatgcttat gtaccaacac 
10441 catactctat ttataaaaat atatctaaac taaatgtagg tagttacata aaatttgatg 
10501 ctaaaggtaa tagtaaagag tataaatatt gggattctaa aaaagtacta gattcagaaa 
10561 aatataaaga ttcgtatgat caagcaatcc tagatttaga aattaagctt aaaagtacac 
10621 tatcaataca aatgcagtca gatgttcctc taggagcatt tttatccgga ggaattgact 
10681 caacaactgt agttgctctt atgcaaagta tgtctaaaga taagataaac acttttagta 
10741 taggttttaa tcaaaaagaa tataatgaag ctgagcatgc aagagcagta gcaaaacata 
10801 taggtacaaa ccacacagat atgtatgtta cagaaagaga tgctcttgat gtaataccaa 
10861 aacttgctgg aatatatgac gagccctttg ctgattcatc acaaatacca acgtatcttg 
10921 tgagtaaaat agctaagtcg aaagtaacag ttgcactatc aggtgacgct ggtgatgagc 
10981 tctttggcgg ttataataga tactttttag caccaaatat tgctaaaaaa atcaaatttg 
11041 ctaagttact taaatatgca ccagatgctt ggataaaaaa agctgagata ttaaattttg 
11101 gtaagttcgc tttattagca gataaactac taaaactaaa aagagttctc gaaaaagcaa 
11161 aaacaaataa agagctttat gtactacttt gttcacaaat aaatgatact agctttgtgt 
11221 taggagcaaa agagtatgat atattaagag ataagaatat ttatgatatt ccacaattat 
11281 ctttccaaga gtggatgatg tttgttgatt ctaatacata tatgatagat gatatattgg 
11341 ttaaggttga tagagcagct atggctaact ctctagagac aagagtgcca tttttagatc 
11401 ataatattta tgaatttgct tattccttac caattgacta taaaatacaa cgaggtaacg 
11461 gaaaaagaat tttgaaagat ttgttatata aatatgtgcc agaaagtttg gtcaataggt 
11521 ctaagatggg gtttggtatt ccgcttgcta aatggttaag agaagattta cgagagtggg 
11581 cagataattt actggattat agtaaaatag acaagcaagg ttacttaagt cctgaggtgg 
11641 tgcaaaaata ttggcaagag catttgagtg gtaaaagaaa ttggcaagca atattatgga 
11701 atattctaat ttttcaggag tggttagata atgagtaaag taaatgtaac aaaaccatac 
11761 ttaccagata taaataaata taaaagctat gtaaataaaa tatacaaaaa tggatggctt 
11821 actaataatg gtccgttagt gcaagagcta gaaaaaagac ttgcaaagta tctaggtgtt 
11881 aaaaatatag ttttagtatc aaatggtaca attgcattag aaatcgcgta tagagcgtta 
11941 ggagtcaaag gaagtgcaat tactactcca ttttcatttg ttgctactac atcttcattg 
12 001 gtttctaaca atgtaaaacc agtgtttgtt gatattgatg agaatactct aagtatagac 
12061 gtctctaaaa ttaagtatgc tattgaagag gatacttcag ctattgtgcc agttcatgtg 
12121 tttggaaatg gttgtgaagt tgaaaaaata gacatgctgg ctaaaaaaca taacttaaaa 
12181 gttatttatg atgcagcaca tgcttttgat gttaagtata agggtgagag tatattaaac 
12241 tatggtgata tttcgacatt aagttttcat gcaacaaaga tttttcattc tattgaagga 
12301 ggtgcgctta tcattaatga tgatagtctt gttgaaaaag ttcgttattt cattaatttt 
12361 ggtatagaaa gctcagaatc aataccttac ttaggtacta atgctaaaat gaatgaattt 
12 421 gaggcggcta tgggactttg tgttctagat gatattatag aaattaagag caaaaggaaa 
12481 gttattacag agatatatga ggctgggtta gatggattgg taaagtttca agaacagaat 
12541 cagcattcta gtaggaatta tagctatttt ccagtaatat ttaggactga ggaggaactt 
12 601 ctcagagtac agaaagcact aatacaaaat gatataatat cgcgtagata tttttatcca 
12661 tcattagata gtcttagtta tatagagcca aagcagtata tgccaatctc aagagatata 
12721 tctaaaagaa tattatgttt gccaatttat gcagagttag aagacgataa aattaataaa 
12781 ataattaata atatcaaaga ggtttcctca tgaaaaaaat atttgttgtt acagataata 
12841 gaactattct aagtgatttt aaaaatatca ttggtagtaa aaatgatgta caggttgatt 
12901 atttttgtag tttcaagagt caaacttctt ttgccaaaga aatatataac agtgagatta 
12961 agccaataga tatgaaaaaa aatggcaatg atcttattgg taagtatgat ttaggttttt 
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Fig.4 (Cont IV). 

13021 cttgtcattc gaaacaatta tttccagcaa aattagttaa ttcagtatta tgtataaata 
13081 ttcatcctgg acttaatcca tataatagag ggtggtttcc acaggtcttc tctattataa 
13141 ataaactacc tataggagca actattcatg tgatggatga agagatagat catggagata 
13201 taatcattca ggaagaagtt gaagttaatt ctttcgaaaa ctcttttgat gtttatgcta 
13261 aagttcaaaa aaaagaagtt gagttgttca ctaaagtcat agatgatatt ttgaataata 
13321 agttcactcg aatcaaacct aactccgaag gcaactataa ttcaattcat gattataaaa 
13381 acatgtgtga aattgattta gataaaatag taacaatgcg ggaagcaatt gactatctaa 
13441 gggctatgac acaccctcca tataaaaata gttatttcat tgatgagcat ggaaataaag 
13501 tatttgttgc tcttgaactt gaaaagataa gttagaaaaa tgagccttaa aaaaaataca 
13561 atatcaaatt atataacaca actatatact agcttaattg gtattgttat acttcctttg 
13621 tatttacaac atttaagtca tgatgcattt ggtctgattg gtttttttac agtttttcaa 
13681 acgtggttac ggttgttgga tgttggtata acaccaactt tatcaagaga agtggctcat 
13741 gttagaggta gtactgatga ctatcattac ttacgcaagt tggttagatc gttagagcta 
13 801 tttttcatta ttgttggtgt tctggtattt attgtaatta gtacacattc aaggtatata 
13861 tccacctctt ggttacatat aggctcgcta gatgctgata gtgtaagtgt atgtattgca 
13921 cttatgggtt taatgtttgc attaagatgg gtgtctgatc tatatggtgg tggtttgcgt 
13981 ggctttgaaa gacaggttct ttataataat ttaagtatca tacaaacgac actacagttt 
14041 attggtggat tattatttat ctgctatgtg tctactaata ttatgtatta ttttgtatat 
14101 cagacaataa ttgcgatact atatctagta tgtattgcaa ttgcatttta taaaatacta 
14161 ccatcatcat ttagcgtggg tttaaggttt gattttaaaa taattagaaa agtgcttcca 
14221 tttgcactag gcattgcata ttctacaaca gtttggatta ttgtcactca atctgataaa 
14281 ttagtgttct cacatgtatt. accattatct gagtatggtt atttatcttt attgatagtg 
14341 atatctagtg ctgttacgat attgtcctct ccgattagca tagctattca gcctagaatg 
14401 acaatgctat tagcccaaca aaatgtaaaa ggaatggaaa gcttatattt aaaatcatcc 
14461 ttgatctcaa ttactttttt atctgctgta gtaacatgtg ttttgatgta ttctcatcag 
14521 ctgttgcagt catggacagg aagtatggaa attgctaatt ggggtagtaa tatcttaaat 
14581 atatatgttt tatcagcatc tattatttgt ataatatcat ttcaatattt tttacagtat 
14641 gcttatggta agttaaagct acataataca tataatacaa ttagtttagt attttttgct 
14701 cctatagtta tatatactgc ttataattat ggagtgtata ctacagcact attatggctt 
14761 ggatatgcta tagtggggct gataatctgg atgcctattg tacaccatgt atttgctaaa 
14821 ggtatcaata ggtatttttt tataaattta gcagttatta ctatagtatg ttttttatta 
14881 tcgttaatat ttaagggttg gtatatttat ccaagtaaaa ttgggttggt agaattaata 
14941 ttgattgggt. ttgcattttt atttatacaa atttgtatag agtatgtttt gtttcggtac 
15001 aaggttttga ggtgtataga tgattaaagt ttcagtatgt gtgatgacat acaatcaaga 
15061 aaagtatatt ggtcaatgtt tagagtcttt ggttactcaa gagactgatt ttgactttga 
15121 gataatcgtit ggagatgatt tttctacaga tggtacaaga gatgttattc aagagtatca 
15181 aaaaaagtat ccggatatca taaagccagt ttttagagat aagaatgtgg gaattactga 
15241 aaatattaaa gaaatctatt ttgttgcaaa tggtgagtat atagctcata tggatggtga 
15301 tgattatgca ttgcctggta aacttcaaat tcaggctgat tttttggata ataatccaag 
15361 atgtacggga gtttttcata atataaatat actctatcca aatggtaata tacaacatag 
15421 taggtttgct tgttcaaata agagtatatt caatttatca gacactttac gcggagttgc 
15481 tgttggtgca aatagttcaa aaatgttcag aacatcggtt ttggatgatt tgattttacc 
15541 ggatatagag cttctagatt attattttca tgttataaca gcagaaaaag gttatttaag 
15601 ttttttaaat tctaatgaat cctatagtgt gtacagaaaa ggtattggta tcacatctaa 
15661 gtctaaggaa aaaatctata atacttatgc tggattattt gaatattttt tggatagata 
15721 tcctgaagag aaattaaata tttgtatccc tgttgtgcaa atgataattt cggctattaa 
15781 agggagatgt tttattagtg ctattcgtct attcaaaatt ttaattagat caagatgtat 
15841 tccattagta agttggttta aatatagatt tgaaaaataa atatcattta gaggattatg 
15901 tgaaatgaag ggaataattc tagctggtgg cagtggtaca aggctatatc cacttacctt 
15961 gggtgttagc aaacagctgc tacctgttta tgacaagcca ttgttatact atccactatc 
16021 tgtgcttatg cttgcaggta ttagggagat attaattatc tctacagtgc gtgatatctc 
16081 acttatccaa gagcttcttg gtgatggttc acaatttggt atacagttga gttataaaat 
16141 ccagccatca ccagatgggc ttgctcaagc atttattctt ggtgaggagt ttttggcggg 
16201 tgactcagct tgtttgatat taggagataa tatctactat ggtcaaggta tgactacaat 
16261 gctagagtct gcaagagcac agtgtggagg tccagctggt ggcgcttgtg tttttggtta 
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Fig.4 (Cont V). 

16321 ttatgttaat gatccgcata gatatggtat agtcgaattt gataagcaaa aaaatgtaat 
16381 ttcggtagag gaaaagccac agaatcctaa gtcacactat gctatcacag gtttatattt 
16441 ttatgataat aatgttgttg agtatgctaa acaagtcaaa ccatctgcac gtggtgagct 
16501 agagattact tcacttaatg agttatatct aaaagaaaat aagctaaatg tcgaactctt 
16561 agggcgtggc tttgcttggc ttgatgctgg tacgcatgat tcattgctag aggcaggtca 
16621 atatgtcgca actattgaga aaagacaagg gcttaaaatt gcatgtttgg aagaaattgc 
16681 atggcgtaaa ggctttatct caacacaaca agttctagct caagctgaaa aactttctaa 
16741 gacagagtat ggtcagtatc tgaagaattt aattaaggat ggtttataaa ttaatccgtc 
16801 atacccatga aggtgggtat ctcataaaag ttggatatgt tttggagatt ccaatctgcg 
16861 cagtaatgac aggtttggta atatatagcg atgttttaca atgactaaaa atggttttat 
16921 gtatattctt acaaataagg ataatactgt tctgtacata gttgtaacat ctaatttgat 
16981 aaaaagaatg tatgagcata aacatagcct tgcagatggt tttactaaaa atataatgtt 
17 041 aataagttag tttattttga aatttatgaa gatataaaag cagcaattct gtgagaaaag 
17101 cagttgaaaa aatgaaacag atcttggaaa gaacgaatta ttaatgagat gaatccgaat 
17161 tggaatgatt tatatgaatt aatatgtgag taaaactttt gtcttactgg tgcagatagg 
17221 tatctctaaa tatcagatgt gattgggaga ttaccgccta cgcggtaatg acaagtttat 
17281 gcggtaatga tagtttagtg agagaatgac tagtcactat aggaatgatg atgtaatgag 
17341 gaatgaaaaa atgaactaca aaccaaaaaa tatcctagta acaggtgcgg cgggatttat 
17 401 tggtagtaac tatgtgcgta tgatgttatc acgctatagt gatatcaaaa taatctcgta 
17461 tgataagctt acttatgcgg gtagtttaga taatctaaaa gacttgaata atgaacataa 
17521 ccatactttt ataaaaggtg atatttgtga tgaagtttta gtatatcaaa cactgaaaga 
17581 atataaaatt gatacgatag tacattttgc tgcagaatcg catgttgata attcaattgc 
17641 taatccaaag gtatttttag aaacgaatgt gataggtaca tttacacttt tagattgtgc 
17701 taaaaggtat tggttagatg agctaggttt agaagaaact agttgtaggt ttcatcatgt 
17761 atctactgat gaggtatatg gtaccttggc aaaagatgaa ccagccttta ctgagattaa 
17821 ggcttatgag ccaaattcac cgtattcggc atctaaggcg ggatctgatc atatttctag 
17881 agcatatcat catacctata aacttccggt aacaatttca aattgttcaa acaactatgg 
17941 accataccaa catcgagaga aattaatccc tgtagtgata aatagttgta taaactacaa 
18001 gcctattcct gtttacggag atggttcgaa tattcgagat tggctatatg tagaagatca 
18061 ctgcgatgct atccagacaa ttgttgagaa aggagtggtt ggagaggttt ataatattgg 
18121 tggtattaat gaagttgata atctaacctt ggtaaaaact atctgtaaac taatggatga 
18181 atataaacca gaaaatgctc cacattctaa cttaatcaca tttgtggaag atagaaaagg 
18241 acatgattgg cgttatgcta ttgataacag caagattcag aatgagttag gatggaagcc 
18301 atcacaagat tttgataaga tgtttagaca aactattgag ttttatctat agcttaaata 
18361 tttatcttat gagtatctct aaaaaatcaa tttaatttat ttttgtgtta aaaagtagtg 
18421 ttcgcaagaa tatagttaat ccgaaagata tttgtagaaa aagatatttg tagaaatgtt 
18481 ataatgtcta ataaaaatgc catcatatag ccaagatttt agagacatcg taattaataa 
18541 acatgaagaa ggtatgacgg agttcgagct gagtaagttt tttaacatag ataagcgtac 
18601 agttgtttca tggatagagt tttataaaag aaccggagat tatagttcaa agcaaggagt 
18661 tggttgtggc agagtcgcta gctttaccga taaaacattg attgaacagt atttgataga 
18721 tcatccagat gcaagtgcat tagatataaa agaagcatta gcccctgata ttccaagaag 
18781 tacattttat gattgtctta atagacttgg ttttagtttt aaaaaaagac tccaaaatat 
18841 aagcaaagaa aagaacatga aaggttggag tatatagaaa aactaaaaga aatagccaat 
18901 aaatttgatg tacaaatatt atatctacct ccgtactctc cagatttaaa tcctattgaa 
18961 aaggtttggg ctaactatta aaaaaatatt tagaaaagtg aataatagtt ttgaaaaatt 
19021 ttgtgatgct atctcttatg tgtttaacaa aatactctcg gattaactat atcatgctgc 
19081 taaaatattc ttggtattct ctggtcaaaa ctgacataat gatgctctac tttgtataag 
19141 gtttgctaca aatattatct aaacaaacat acaaaggtaa tttttagaga tcctattata 
19201 aacctactat ctaaatttag taagttaagt tatgacaata tttaatttgc tgatttattg 
19261 ttgaatatat tagctttcta tataattaat caatatcaaa gttatttagg tttttataat 
19321 gattactcct attatcttat ctggaggatt cggctcaagg ctatggccac tatcacgaga 
19381 ggcatcgcca aagcagttta tcggcttggt tgatgaacat agtctattag aaaatacaat 
19441 taagcgacta gataatgtca aggatataac ttcacctgta gttgtctgta atgaaagtca 
19501 tagattccaa gttgctgaag tgttgcggaa aatcaataaa aaaggcgata tactcctaga 
19561 gccattagcc agaaatactg ctccagcaat tgcacttgca gcactacatt tagctattaa 
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Fig.4 (Cont VI). 



19621 
19681 
19741 
19801 
19861 
19921 
19981 
20041 
20101 
20161 
20221 
20281 
20341 
20401 
20461 
20521 
20581 
20641 
20701 
20761 
20821 
20881 
20941 
21001 
21061 
21121 
21181 
21241 
21301 
21361 
21421 
21481 
21541 
21601 
21661 
21721 
21781 
21841 
21901 
21961 
22021 
22081 
22141 
22201 
22261 
22321 
22381 



tgatccaaat 
ttttcatcaa 
ctttggcatt 
gactactgta 
tgcacaagag 
agctagagtg 
aaaaacttat 
tgccctagtt 
tatagtgcct 
tattgctgca 
caaaaatagt 
aataattgtt 
caaaaaaata 
aatttataaa 
ggcgattcaa 
gcattggatt 
tagaccaaat 
caagattcca 
tgttagacta 
acaaactata 
tagaggtctt 
tcaattccta 
tctacgtgag 
tcatgagcca 
taatcagata 
taagtttaat 
aactatcagc 
attaccagag 
ccctaacaac 
agagatagtc 
cgaaaaattt 
gcagtgggca 
gccactagtt 
agctaaatat 
aaagataggt 
tgcaatgaat 
aggatttcta 
aagagatgct 
cgtgtcagag 
ttttgcttcc 
tttagataaa 
tacagatggt 
taatgctcca 
aaatcaatat 
ctaacttttc 
aaatgtcatg 
gaaactagtt 



acaattatgc 
gctatcgaaa 
acaccaactt 
aat.ggagt.tt 
tatttagata 
tatttagagg 
caaaagtcac 
caatcacagt 
atgcaacaaa 
aaagatagtt 
tatttacgct 
gaaacagcag 
gtcgaagttt 
ccttggggtt 
cttgaaccgg 
gtgatttctg 
gagtctgtat 
gttattctta 
gatacaagta 
ataaaagaaa 
gtttcagcta 
gagcaaaaat 
agtagcccta 
atatactgtg 
ccgtcagtta 
actccatatg 
attgatgaaa 
cctagtaagc 
tgtctagcag 
aaagagattc 
gtatctgtag 
agcaagtata 
agtgatgagt 
ctccaagcca 
tattttagta 
gaattactct 
ttggctagtg 
gttataccaa 
cttttatttg 
gagaaaagcc 
attatatcgg 
gttagagtaa 
gagcttaggt 
tgtgtggatt 
tcttcaccat 
ctggatttat 
caggatgaca 



tagttttagc 
aagcacagca 
gtcctcatga 
ataaggtaga 
gtggcaaata 
ttttagagaa 
agcaggattt. 
caatagacta 
gtggctggtc 
gtggtaatgt 
cgcatgatcg 
atgctatact 
tgaaaattca 
cagcgacaat 
gcaagaagtt. 
gaactgctac 
atataaaaat 
tagaagtaca 
gttaatataa 
taatcaaatc 
tgacagataa 
actctattgc 
gaataacaac 
gtgagatacc 
tggttactgg 
gtgaagttct 
gtatttttga 
aagcatatac 
gtaagactat 
tagagaaact 
ataccgaggc 
aagttgatag 
atggcaattg 
atgttatcgt. 
acgtgattag 
caaataatca 
atatttgtaa 
tgttggctgt 
atttgccatc 
aagaaatctt 
agcattttga 
ctttgacaaa 
gctatacaga 
tgattaacaa 
tgaaccatta 
ttcagcgttt 

g 



tgctgaccat 
aaaagttatt 
aggctatggt 
taaatttgtt 
ctattggaat 
gttacagcca 
agattttgtg 
cgcagttatg 
tgatgttggc 
ggttattggc 
tttattggct 
tgtcgcggat 
gcagcgaagt 
attagaggat 
atcattacag 
ggtaactatt 
aggcgaatct 
agttggagaa 
aaacaattag 
tagcggcgta 
gatctgttgg 
taagggtact 
agttgttatt 
atcaccagct 
tagtcatatt 
caaagaagat 
taaaaatggc 
acagtatatt 
agggctttat 
aggtgctaag 
aattcgccag 
tatagtttca 
gctaaaaggt 
gacaccagta 
aactaaaata 
aaatgctgtg 
agatgataaa 
aatgatgcta 
tcgatataca 
gaagtcaata 
tggtaaaaat 
tcaagatatt 
ggcagctagt 
aaacatttga 
ctaaccttat 
cattataaat 



catattgaaa 
aaagatgatt 
tatattaaac 
gagaagccta 
agcggtatgt 
gagatttaca 
cgttttgata 
gagaaagcaa 
tcttgggact 
gatgtgatta 
gcagtcggag 
aagaacaaaa 
gaattattac 
aagtctggtt 
caacattatc 
ggtactacta 
cacagacttg 
tatataagtg 
atagaaaaaa 
aagtttggta 
ctttatacaa 
aaaattgcta 
aaagctatca 
gtaatgctat 
ccagaggata 
gaagaaatga 
atgtttttac 
gacaggtatg 
cagcactcat 
gttatcttgc 
gaagatgtaa 
actgatggcg 
gatattttag 
agtagcaata 
ggctcgccgt 
gttggatatg 
actctaaaag 
tctatcaact 
gcaagtagta 
ttagcaggtg 
agcattgaaa 
atccatctta 
gatgagcagg 
agatcagtca 
ctatagctag 
atcaatttta 



atctggagat 
ctttagttac 
aaggggtaca 
gtgtggtcgt 
ttatgttcac 
gaggatgtga 
aacaaagctt 
ctaatgttgc 
ctttgtatga 
ctagtaatgt 
ttaatgattt 
ctcaagatgt 
agcataagca 
ataagataca 
accgtagtga 
agtctattgt 
aaaataatgg 
aagacgatat 
atataatgag 
ctagtggagt 
aagcttttat 
tagctcatga 
tagatagtgg 
atggtatatc 
gaaatggtat 
ttgttagcca 
aaaaactaga 
tagatttttt 
ctgtaggacg 
tagaattttc 
agcttgctaa 
atgctgatag 
gtgtactgac 
ctgtggcaga 
atgtaattgc 
aggcaaatgg 
cgctgcctac 
ctaataaaac 
aaattgatga 
aatcagatct 
atatcgatac 
gaccatctgg 
caaaaagttt 
aaaatattcc 
ccacagataa 
ttgagatcct 
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Fig.5. 



SEQ ID NO 2 MNYHI KEVFWS 1 1 LS FLKSQKGI HTNDEAKLRLFI EAVFYVLRT 

GCQWRMLP FY Y GKYRS I HKR FKDWCDKDI FS RLFKS VQNPDLQE VMLDST IARAHACA 
TGYDKDDNQAIGRSVGRITTKIHAMTDAL<^PIEILLSEDKTHDSKVAIDLLKNVYNT 
KVIi^RAYHSNEIRQHIQGISSEAVIPCKSOTLNHIPFDSHVYKERHLIENFFSKIKH 
FRRVFSRFDKT I LAY I GMI KLACTFI WLR 



SEQ ID NO 3 MS FYDNRT LNFWI I VLT 1 1 TVNWT FY I FKQDVNLHFL LALVLL 

RCLS S FLLLRD YMASWRKSTQKT FLRKAFINLPVFFIVALFFYGKVT FSLIFS E FLFY 

VF LI S LS VY F YWYLMNRGS VDKS KTAVI Y GAGAAGTKIAQE LASAGYR I KC FVDDNE T 

LQKRSIDSKKVLSKAELTKLLLSSRFDLLVIALPRNANQWKNIYKEFEKDFNQIRIM 

P P LEE I LQDENFMS QLKPVS L YDLLARDTKS LDKE SISNFI KNKWLVT GAGGS I GSE 

IVHQCIKYQAKELILVDHSEFNLYKITEECSHFNINSVLCSVCDRKALAEVFQKYTPN 

I VFHAAAY KHVPLVE EN I S RAI RNNI LGT KNAI DLAIEAGVE S F I L I S TDKAVR PTNV 

MGATKRVCELYLQNVDPKhTTKLAAVRFGNVLGSSGSVIPKFEEQIRKGGP^^ 

TR YFMLI PEACELVLQAGAIAKNSEVFVLDMGQPVKI I DLAKQFIRLSGRGDI DIKIV 

GLRPGEKLYEELLIEEDDVSTDYKDIFIGRRTFYDINTLNQDIESLIKDDVDQLVILK 

KIVPEFEHRLNG 

SEQ ID NO 4 MF YEVFKRLLDI LLS FMGLLLLS PI FLI 1 1 FMIKKDSKGPI FFK 

QKRYGKDKQFFYIYKFRTMYVDTPKDMPTHMLQDPSKCITKVGGFLRKSSLDELPQII 
NILKGEMS IVGPRPALWNQDDLIAQRDKYGANAVPVGLTGWAQINGRDELPI PDKAKL 
DGDYVKNKSTWFDLKCI FLTVFS VFAKKGWE GGTGALGNKEDLK 



SEQ ID NO 5 MKKR I LVT GLS S Y I GNS FAAKYNSDF S I DKI S LRDVSWANI DLS 

GYDAVLHVAGIAHTSKDPKLKEKY YKINTQLTYDLAKQAKDQGVRQFVFLSS I IVYGD 
SAPIGQQKVITKYTEPKPDDFYGDSKLQTEIKLNSLASDDFNIAIIRPPMVYGEGSKG 
NY PKLVKLAKYTFI FPNINNQRS VI S I DNLSKE IAE 1 1 LQTKHGVFLLQDNE Y FCTSQ 
FIKNYRKDVLGKRTYLTKI FNPI IRLLAKKVDFINKVFGNLTYEK 



SEQ ID NO 6 MRSKLLFIANDFDIVIYRFRREVIESFAAKEYEIVLVTPYSKKA 

EVFCKSLGVKYINVTDIDRRGKNPFKDLLLLFNYFKIIKKEKPDYIFSYTIKPNLYVGL 
VNLFFRKKFYPNVTGLGSVFANHGIVQKFIISLYKLSFKSTTKVFFQNEQNKKLFIAK 
KI ISGEKS ILLPGSGVNLDENKYVDYPKDQGI LKFVFLGRIMKEKGI YELLEAFAI LE 
KKYKNISLDIYGFCDENKSNFMGKVNTIKSVKFYGFTDNTKEKIASAHAVVLPSYH 
MSNVLIiEAAAI GR PVIASD I PGCRE I FDDGLS GLS CNPNDVS S LRNS LEQ FINMS YTD 
KIAMS YKARAKIEKDFDRS I WNAYLQQN 

SEQ ID NO 7 MS LYE DIVAKREKVS LVGLGYVGLPI AIAFAKKIDVLGFDI CET 

KVQHYKDGFDPTKEVGDE^VRNTTMKFSCDETSLKECKFHIVAVPTPVKADKTPDLT^ 
IIKASETVGRNLVKGAYWFESTVYPGVTEDVCVPILEKESGLRSGEDFKVGYSPERI 
NPGDKVHR LET 1 1 KWS GMDEE S LDT IAKVYE LWDAGVYRAS S I KVAEAAKV I ENS Q 
RDVNIAFVNELSIIFNQMGIDTLEVLAAAATKWNFLNFKPGLVGGHCIGVDPYYLTYK 
AAELGYHSQVI LSGRRINDSMGKFWENLVKKLISADI PVKRARVAI FGFTFKEDCPD 
TRNTRVT DMVKE LNE YGI E P Y 1 1 DPVADKEEAKHE YGLE FDDLS KMVNLDAI I IAVSH 
EQFKDITKQQFDRLYAHNSRKIIFDIKGSLDKSEFEKDYIYWRL 
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Fig.5 (Cont I). 

SEQ ID NO 8 VAYDNVKFPHGS FFLVTGGAGFI GSNLCEVLLSKGYRVRCLDDL 

SNGHYHbTVTiPFLTNSNYEFIKGDIRDLDTCMKACE GIDYVLHQAAWGSVPRSIEMPLV 
YE D INVKGALNMLEAARQNNVKKFVYAS S S SVYGDE PNL PKKE GRE GNVLS PYAFTKK 
ANEEWJWLYTKLYGLDTYGLRYFWFGRRQDPNGAYAAVIPKFIKQLLNDEAPTINGD 
GKQSRDFTYIE>A^IEANIJ^CIADSKYAGESFNIAYGGREYLIDLYYNLCDALGKKIE 
PNFGPDRAGDIKHS^O^DISKARNMLGYNPEYDFELGIKHAVEWYLIN 



SEQ ID NO 9 VY I KKVS FKI L YL Y LLAFC 1 1 FS LE FKFAI LN I IVY L P AC I LGF 

LAIiKKLFVGNI VKKQLAFL F FF FFLSMI Y LI I VQ 1 1 L LDAAS L FPQ FL FNI L IAI GFC 
NFI FVSYDNNENYFFNMSKI I FFVTFLQS I FVFLSRYY I FLNDWI FFFLVKKGNIE IS 
NVIE YKLRVFGLSNAGGDGLGFS ITI GLCFSI FYFIKYIKGKS I FTKLMLFVPLILIV 
FSNI FISRTSLLTSSLI LLITIFYI Y IKKEKLLFI 1 1 LALFFLS IWI LFKLNLNLSWA 
FENIYSYIQSGDFSHGSLSVLINKMLFVPDNLLTWIFGCEDVSNTDIGYIKYLYYYGI 
I FSMFFY I LI I FLY FEMRKC FI FSEYRS L FLLLLI VCLVFQAKI I FLTVGLFTKLT 1 1 
LFI FSLKENS FTTRSVI 



SEQ ID NO 10 LKRFVHLI INLNQGGAETML YKLCKSMDKS I YHITI I S LMGRGV 

FANKLEAYG\H<VYTLNLNKFNVLFVL FKY I KI IRRI KPDVI HAWMYHANVISILCKPF 
YRKTKYINSIRMGLENYDGHKNLTKFMIKLNAKFSKFSDLTLNNSKKS LEDHQNIGFK 
NQCFIANGFDKDVFKPS FLKYEKFRLNNDLDDNVKI IGI IARNHADKNISRFLQIANL 
LLKSNPSLRFLIAGRECSKIDIGSYLDNKSNVNKFFVFESVDSSEYLPVLDLYLSTSK 
VEGFPNILAEAMLCEVPIVASNVGDCKDI LNGYGEVFELSQGNKEI IEKIMKVLETTV 
VMKKRMRE Y 1 1 NNFS I E AI LEKHEKL YHE GS V 



SEQ ID NO 11 MCGWGFY S FNKEE GFDS I INQS LLS I KHRGS DDS GYWCDNQVT 

LGHTRLSIHDITNAGHQPMLSNSGNTAIVFNGEIYNYLSIKNQLLSEYSNLKFKSNSD 
TEVLVNAIELWGIDKTLEKCICMFAFGVYSRKTSCLILARDRFGEKPLYFGIQNGILG 
FASELKALKPLKE CGWRFDI DRDALATYMRYAYVPTP YS I YKNI SKLNVGS YI KFDAK 
GNSKEYKYWDSKKVLDSEKYKDSYDQAILDLEIKLKSTLSIQMQSDVPLGAFLSGGID 
STTWALMQSMSKDKINTFS IGFNQKE YNEAEHARAVAKHI GTNHTDMYVTERDALDV 
IPKLAGIYDEPFADSSQIPTYLVSKIAKSKVTVALSGDAGDELFGGYNRYFLAPNIAK 
KIKFAKLLKYAPDAWIKKAEILNFGKFALLADKLLKIJCRVL 

NDTS FVLGAKE YD I LRDKNI YDI PQLS FQEWMMFVDSNTYMI DD I LVKVDRAAMANS L 
ETRVPFLDHNIYEFAYSLPIDYKIQRC^GKRILKDLLYKYVPESLVNRSKMGFGIPLA 
KWLRE DLREWADNLLDYSKI DKQGYLS PEWQKYWQEHLSGKRNWQAI LWNI L I FQEW 
LDNE 



SEQ ID NO 12 MSKVNVTKPYLPDINKYKSYVNKIYKNGWLTNNGPLVQELEKRL 
AKYLGVKNIVLVSNGTIALEIAYRALGVKGSAJTTPFSFVAT^ 
DENTLSIDVSKIKYAIEEDTSAIVPVHVFGNGCEVEKIDMLA^ 

VKYKGESILNYGDISTLSFHATKIFHSIEGGALIINDDSLVEKVRYFINFGIESSESI 
PY LGTNAKMNE FEAAMGLCVLDDI I E IKSKRKVITE I YEAGLDGLVKFQE QNQHS S RN 
YSYFPVIFRTEEELLRVQKALIQNDIISRRYFYPSLDSLSYIEPKQYMPISRDISKRI 
LCLPI YAELEDDKINKI INNIKEVSS 



SEQ ID NO 13 MKKI FWTDNRTI LSDFKNI I GS KNDVQVDYFCS FKSQTS FAKE 

IYNSEIKPIDMKKNC^nDLIGKYDLGFSCHSKQLFPAKLVNSVLCINIHPGLNPYNRGWT 
FPQVFS 1 1 NKL P I GAT I HVMDEE IDHGDI I IQEEVEVNS FENS FDVY AKVQKKE VE L F 
TKVIDDILNNKFTRIKPNSEGNYNSIHDYKNMCEIDLD 
KNSYFIDEHGNKVFVALELEKIS 
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Fig.5 (Cont II). 

SEQ ID NO 14 MSLKK2STTISNYITQLYTSLIGIVILPLYLQHLSHDAFGLIGFFT 

VFQTWLRLLDVGITPTLSREVAHVRGSTDDYHYLRKLVRSLELFFIIVGVLVFIVIST 
HS RY I STSWLHI GS LDADS VSVC IALMGLMFALRWVSDLYGGGLRGFERQVLYNNLS I 
IQTTLQFIGGLLFICYVSTNIMYYFVYQTIIAILYLVCIAIAFYKILPSSFSVGLRFD 
FKIIRKVLPFALGIAYSTTVWIIVTQSDKLVFSHVLPLSEYGYLSLLIVISSAVTILS 
SPISIAIQPRMTMLLAQQNVKG^SLYIJCSSLISITFLSAVVTCVmYSHQLLQSWTG 
SME IANWGSNI LNI YVLSAS 1 1 CI I S FQYFLQYAYGKLKLHNTYNTI SLVFFAPIVI Y 
TAYNY GVYTTALLWLGYAI VGLI IWMPI VHHV FAKGI NRY F FINLAVI T I VC FLLS LI 
FKGWY I YPSKI GLVELI LI GFAFLFI QI CIE YVLFRYKVLRCIDD 



SEQ ID NO 15 MIKVSVCVMTYNQEKYIGQCLESLVTQETDFDFEIIVGDDFSTD 

GTRDVIQE YQKKYPDI I KPVFRDKNVGI TENI KE I YFVANGE YI AHMDGDDYALPGKL 
QIQADFLDNNPRCTGVFHNINI LYPNGNIQHSRFACSNKSIFNLSDTLRGVAVGANSS 
KMFRTSVLDDLILPDIELLDYYFHVITAEKGYLSFLNSNESYSVYRKGIGITSKSKEK 
I YNTYAGLFE Y FLDRYPEEKLNICI PWQMI I SAIKGRCFI SAI RLFKI LIRSRCI PL 
VSWFKYRFEK 



SEQ ID NO 16 MKGIILAGGSGTRLYPLTLGVSKQLLPVYDKPLLYYPLSVLMLA 

GI RE I LI I STVRDI SLIQELLGDGSQFGI QLS YKI QPS PDGLAQAFI LGE E FLAGDSA 
CLILGDNIYYGQ(31TTMLESARAQCGGPAGGAC^FGYYVNDPHRYGIVEFDKQKNVIS 
VE EKPQNPKS H YAI TGL YFY DNNWE YAKQVKPSARGE LE I TS LNE L YLKENKLNVE L 
LGRGFAWLDAGTHDS LLEAGQYVAT I EKRQGLKIACLE E IAWRKGF I STQQVTiAQAEK 
LSKTEYGQYLKNLIKDGL 



SEQ ID NO 17 MTSHYRNDDVMRNEKMNYKPKNILVTGAAGFI GS NYVRMMLSRY 

SDIKIISYDKLTYAGSLDNLKDLNNEHNHTFIKGDICDEVLVYQTLKEYKIDTIVHFA 
AESHVDNS IANPKVFLETNVIGTFTLLDCAKRYWLDELGLEETSCRFHHVSTDEVYGT 
LAKDE PAFTE I KAYE PNS PYSASKAGSDHI SRAYHHTYKLPVTI SNCSNNYGP YQHRE 
KLIPVVINSCINYKPIPVYGDGSNIRDWLYVEDHCDAIQTIVEKGVVGEVYNIGGINE 
VDNLTLVKTICKLMDEYKPENAPHSNLITFVEDRKGHDWRYAIDNSKIQNELGWKPSQ 
DFDKMFRQTIE FYL 



SEQ ID NO 18 MPSYSQDFRDIVINKHEEQ4TEFELSKFFNIDKRTWSWIEFYK 

RTGDYSSKQGVGCGRA/7VSFTDKTLIEQYLIDHPDASALDIKEALAPDIPRSTFYDCLN 
RLGFS FKKRLQNISKEKNMKGWS I 

SEQ ID NO 19 MITPIILSGGFGSRLWPLSREASPKQFIGLVDEHSLLENTIKRL 

DNVKDITSPVVVCNESHRFQVAEVLRKINKKGDILLEPLARNTAPAIAI^ 
PNTIMLVLAADHHIENLEIFHQAIEKAQQKVIKDDSLVTFGITPTCPHEGYGYIKQGV 
QTTVNGVTKVDKFVEKPSVVVAQE YLDSGKYYWNS GMFMFTARVYLE VLEKLQPE I YR 
GCEKT YQKSQQDLDFVRFDKQS FALVQSQS I D Y AVMEKATNVAI V PMQQS GW S D VGS W 
DS LYD IAAKDS CGNWI GDVITSNVKNS YLRS HDRLLAAVGVNDLI IVETADAI LVAD 
KNKTQDVKKIVEVLKIQQRSELLQHKQIYKPWGSATILEDKSGYKIQAIQLEPGKKLS 
LQQHYHRSEHWI VI SGTATVTI GTTKS I VRPNESVYIKIGE SHRLENNGKI PVT LI EV 
QVGEYISEDDIVRLDTSS 
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Fig.5 (Cont III). 

SEQ ID NO 20 MRQTI I KE 1 1 KS S GVKFGTS GVR GLVSAMTDKI CWLYTKAF IQF 

LEQKYSIAKGTKIAIAHDLRESSPRITTVVIKAIIDSGHEPIYCGEIPSPAVMLYGIS 

NQIPSVMVTGSHIPEDRNGIKETTrPYGEVLKEDEEMIVSQTISIDESIFDKNCTlFLQK 

LELPE PSKQAYTQY IDR YVDFFPNNCLAGKTI GLYQHSSVGRE I VKE I LEKLGAKVIL 

LEFSEKFVSVDTEAIRQEDVKLAKQWASKYKVDSIVSTDGDADRPLVSDEYGNWLKGD 

I LGVLTAKYLQANVIVT FVSSNTVAEKIGYFS NVIRTKI GS P YVIAAMNELLSNNQNA 

VVGYEANGGFLLASDICKDDKTLKALPTRDAVIPMIAVMMLSINSNKTV 

R YTAS S KI DD FAS E KS QE I LKS I LAGE S DLLDKI I SE H FDGKNS I EN I DTTDGVRVTL 

TNQDI IHLRPSGNAPELRCYTEAASDEQ7VKSLNQYCVDLINKNI 
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Fig.6. 

SEQ ID NO 1 

atatttatttttgtgcacagaacctaatttgcatttttgtgcacaaagaaaatttttttg 
atataatagactttaataggatattttctaaaaattaacaaatgtctttctacgataata 
gaacgcttaatttcgtggtaataatagttttaactattattactgttaattggactttct 
atattttcaagcaagatgttaatttacattttttacttgcattagttttgctgagatgct 
tgtcatcttttttactacttagagattatatggctagttggcgtaagtcgactcaaaaaa 
cttttttacgtaaggcttttattaatttgccagtatttttcatagtggcattattttttt 
atggcaaagtcactttttcgttgatattctctgagtttttattttatgtttttttgatca 
gtttaagtgtctacttttattggtatttgatgaacagaggatcagtggataaaagtaaaa 
ctgcggttatttatggtgcaggtgctgcaggaacaaagattgctcaagaacttgcttctg 
ctggttatcgcatcaaatgttttgttgatgacaatgaaactttacaaaaaagaagtattg 
atagtaaaaaggttctatctaaagctgaattaacaaaactattgctatctagtagatttg 
accttttggttattgcattgccaagaaatgcaaaccaagtagtcaaaaatatatataaag 
aatttgaaaaggattttaatcagattagaattatgccgcctcttgaggaaattcttcaag 
atgagaattttatgtcacagttgaagcctgtttcactctatgatctattagcgcgtgata 
ctaagagtttagataaagaatctatctctaattttatcaaaaataaggtggtgctagtca 
caggagctggaggtagtataggttctgaaatagtacatcaatgtatcaagtatcaggcaa 
aagagttgatattggttgatcatagtgagtttaacttatataaaattactgaggagtgta 
gtcattttaatatcaatagtgtgctatgttctgtttgtgatagaaaagcattggctgagg 
tttttcaaaagtatactccaaatatagtatttcatgctgctgcctacaagcatgttccct 
tagttgaggagaatatctctagagcaattagaaataatatcttaggtactaagaatgcta 
tagatctggctatagaagctggtgttgagtcatttatattgatttccactgataaagcag 
tgcgaccaacgaatgttatgggggctaccaagagagtttgtgagctgtatttacagaatg 
ttgatcccaaaaataccaagcttgctgcagtgcgttttggtaatgtgcttggtagtagtg 
gcagtgtgattccaaaatttgaagagcaaataagaaaaggtggtcctgttacagttactc 
atcctgaaattacacgttattttatgttgataccagaagcttgtgaactggtcctacaag 
ctggtgctattgcaaaaaattcagaggtctttgtcttagatatggggcaacctgtcaaga 
ttattgatcttgctaaacaatttattagactttctggtagaggtgatattgatattaaaa 
tagttggtttgcgtccaggagagaaactttacgaagagcttttgatagaggaagatgatg 
ttagtaccgactataaagatatttttattggtagaaggactttttacgatattaatactc 
taaaccaagatattgaatcgttgatcaaggatgatgttgatcagcttgtgatattaaaga 
aaattgttccggaatttgaacatagattgaatgggtagtggttttatgttttatgaggtt 
tttaaaagattgcttgatattttactttcttttatggggttgttgttattaagtcctatt 
ttcttaattattatttttatgataaagaaagattcaaaaggacctatattttttaaacaa 
aagcgctatggtaaagataagcaatttttttacatatataagtttagaactatgtatgtt 
gatactccaaaagatatgccaacgcacatgttacaggatccatcgaaatgtataactaag 
gttggaggatttttaaggaaatcatctttagatgagttgccacaaattataaatattcta 
aaaggtgaaatgagcatcgtgggtccaagaccagcattatggaatcaagatgacttaata 
gcacaaagagataagtatggggcaaatgctgtgcctgtgggactgactggctgggcacag 
attaatggtagggatgaattaccaatacctgataaagctaaacttgatggtgattatgta 
aaaaataaaagtacatggtttgatttaaaatgtatttttttgacagtattttctgttttt 
gccaaaaagggcgtcgttgagggtggtactggagctttaggtaacaaagaggatttaaag 
tagtatgaaaaaaagaatcttagttacaggtttgagtagctatattggtaactcatttgc 
ggctaaatataactcagattttagtatcgataaaatatctttgcgcgatgtttcgtgggc 
aaatatagacttaagtggttatgatgctgtattgcatgtcgctggaattgcccatacttc 
aaaggatcctaaactaaaagaaaaatactataaaataaatacgcaattaacttatgatct 
ggcaaaacaagctaaagatcaaggtgttcgacagtttgtgtttttaagtagtattatagt 
ttatggtgatagtgcgccaataggtcaacaaaaagttataactaaatataccgaacctaa 
accagatgatttttatggagatagtaagcttcaaactgaaattaagctaaatagcctggc 
tagtgatgactttaatattgctataatcagaccaccaatggtatatggagaaggctcaaa 
aggcaactatccaaagttggttaaacttgcaaagtatacttttatttttcctaatattaa 
taaccaaagaagtgttatatctatagataatttatctaaagagattgcagaaataatttt 
gcaaactaaacatggagtttttctacttcaagataatgaatatttttgcacttcacagtt 
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Fig.6 (Cont I). 

tataaaaaactatagaaaagatgttttaggtaagagaacttatctgacaaaaatttttaa 
tccaattataagattgcttgctaaaaaagtagattttattaataaagtttttgggaattt 
gacttatgagaagtaagttattattcatagctaatgattttgatattgtaatatatcgtt 
tcagaagagaagtaatcgagtcttttgctgctaaagagtatgagatagtactagtaacac 
catattctaagaaagcagaggttttttgtaaaagtcttggtgttaagtatataaatgttg 
atatagatagacgaggcaaaaatccttttaaggatttgcttcttttatttaactatttca 
aaataataaaaaaagaaaaacctgattacatttttagctatacaattaaaccaaatttgt 
atgttgggttagtgaatttgttttttaggaagaagttttatccaaatgtaacaggcttag 
gaagtgtttttgctaatcatggtattgttcagaagtttataatatctttatataagttat 
catttaaaagcaccacaaaagtattctttcagaatgagcaaaataaaaagttatttatag 
ctaagaaaataatcagtggagaaaaatcaatattattaccaggttctggggtaaacttag 
atgaaaataaatatgttgactatcctaaagaccaaggaatattaaaattcgtttttcttg 
gccgaataatgaaagaaaaggggatttatgaattgttagaagcctttgctatacttigaga 
aaaaatataaaaatattagtcttgacatttatggtttttgtgatgaaaataaatctaatt 
ttatgggaaaggttaatacgataaaatcagtaaaattttatggttttactgataatacta 
aagaaaaaatagctagtgcacatgcagttgttttgccatcttaccatgaaggaatgtcaa 
atgtgctgttagaagcagctgcgataggtagacctgtaattgcgtcagatattcctgggt 
gtagagaaatttttgatgatggtctctctggcttatcatgtaaccctaatgatgtgagtt 
ctttacgtaactcattagagcagtttataaatatgt:cgtat.actgataaaatagct:atga 
gctataaagctagagctaagatagaaaaagattttgatagaagtattgttgtcaatgcat 
acttacagcaaaattaataataagggtttaaattatgagtttatatgaggatatagtcgc 
taaaagagaaaaggtttcattggttggcttgggttatgttggtttaccaatagctattgc 
atttgcaaaaaaaatagatgtgttaggatttgatatttgtgaaacaaaagttcaacatta 
taaggatggttttgatccaacaaaagaagtaggagatgaggctgtcagaaatacgacaat 
gaaatttagttgtgatgaaacaagtcttaaagagtgtaaatttcatattgttgcagttcc 
tacaccagttaaagcagataaaactcctgatttgacgccgattattaaggcaagtgagac 
ggttggtaggaatcttgtcaaaggcgcttatgttgtgtttgaatcaactgtttatcctgg 
tgttacagaagatgtttgcgtaccaatacttgaaaaagagtctggcttgaggtctggtga 
agatttcaaagttggttactctcctgagaggataaatcctggtgataaggttcataggtt 
agaaacaattatcaaagtagtatctggtatggatgaagagtctttagatactatagcaaa 
agtttatgagctagtagtagacgcaggagtttatagagctagtagtataaaagtggctga 
agctgctaaggttatagaaaactctcaaagagatgttaatatagcttttgttaatgagtt 
atcgataatatttaatcagatgggtattgatactctagaggttttagcagcagctgcaac 
taaatggaatttcttaaactttaagcctggtcttgttggtggacattgtattggtgttga 
cccatattacctaacgtacaaggcagctgagcttggatatcattctcaggtaatattatc 
tggtcgtaggataaatgatagtatgggtaaatttgtagttgagaatttagtcaaaaaact 
gatatctgcagatatacctgttaagcgagctagagtagcaattttcggctttacttttaa 
agaagactgtcctgacactaggaatactcgagttatagatatggtaaaagagctcaacga 
gtatggtatagagccatatattatagatccggtagctgataaagaagaggctaaacatga 
gtatggacttgagtttgatgatctaagtaaaatggtcaatctagatgcgatcattattgc 
tgttagtcacgaacagtttaaagatataacaaagcaacagtttgataggctatatgcgca 
taattctagaaagattatatttgacatcaaaggtagtttagataaatctgagtttgaaaa 
agattatatttattggagattgtagtggcttacgataatgttaaatttcctcatggttcg 
ttttttttggtgactggaggtgcgggttttattggctctaatttatgtgaagttttactt 
agtaagggttatagagttaggtgtttagatgatctctcaaatggtcactatcacaatgtt 
gagccgtttttaactaattctaattatgagtttataaaaggtgatattagagatttagat 
acttgcatgaaagcttgtgaaggtattgattatgttctacatcaagctgcttggggaagc 
gtaccaagaagtattgagatgccattagtgtatgaagatataaatgttaaaggtgcatta 
aatatgcttgaagcggctagacaaaataacgttaaaaaatttgtctatgcttctagttca 
tcagtatatggtgatgagccaaatttacctaaaaaagaaggtagagaaggaaatgtttta 
tcaccctatgcatttacaaagaaagctaatgaagagtgggcgagactatacacaaagtta 
tatggtctagatacttatggtctaagatattttaatgttttcggtagaagacaagatcct 
aatggtgcgtatgcagcagttatacctaaatttatcaaacagttattaaatgatgaagcg 
ccaactataaatggagatggtaaacagtcgagagattttacatatatagagaatgttatt 
gaggcaaatcttaaagcatgtttagcagatagtaagtatgccggagagtcttttaatata 
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Fig.6 (Cont II). 

gcttatggaggtagagagtatcttatagatttgtactataatctttgtgatgccttgggt 
aaaaaaatagagccaaattttggtccagatagagcgggtgatattaagcatagtaatgct 
gatatttcgaaggctaggaatatgctcggatataatccggaatatgattttgaattaggc 
ataaagcatgctgttgagtggtatttaattaattaaatggtattttaatcaagtgtacat 
aaaaaaagtgtcttttaaaattttatatttatatttactagctttttgtattatttttag 
tttagaatttaaatttgctatattgaatattatagtttatcttccggcttgtattttggg 
ttttttagctcttaaaaaactatttgtcggaaatattgttaagaaacaattagctttcct 
ttttttctttttctttttatcaatgatttatttaataatagtccaaataatcttacttga 
tgcagcatcattgtttcctcagtttttatttaacattttgatcgcgataggtttttgtaa 
ctttattt-ttgtttcatatgataataatgaaaattatttttttaatatgtctaaaataat 
attttttgttactttcttacaatctatttttgtatttctttcaaggtattatatattttt 
aaatgattggatattcttttttttagtgaaaaaagggaatattgagatttcgaatgttat 
tgaatataagttaagagtattcggacttagtaacgctggaggggatggtttaggattttc 
aattactataggattatgtttttctatattttattttatcaaatatattaaaggtaaatc 
tatatttaccaaacttatgctgtttgtacctttaattcttattgtgttttctaatatttt 
catatctagaacatcactcttaacttcttcacttatattgttaataacaatattttatat 
atatattaaaaaagaaaaattactgtttattataatattggcgctattctttttatcaat 
atggatattgttcaaattaaatttgaatttgagttgggcttttgaaaatatttactcgta 
cattcaatctggcgatttttcacatggaagtctaagtgttttaatcaataaaatgctttt 
tgtgccagataaccttttgacttggatatttggttgtgaggatgttagtaatactgatat 
tggttatattaaatatttatactattatgggattatatttagtatgtttttttatattct 
tattattttcttgtactttgaaatgagaaaatgttttatattttcagagtatcgatcatt 
atttctattgttgttaatagtatgtttagtttttcaagcaaaaataatttttttgacagt 
aggattatttactaaattaaccattatattatttattttttctcttaaagaaaacagctt 
tacaactaggagtgtgatttgaaaaggtttgtacatttaataataaaccttaaccaaggt 
ggtgctgaaacaatgctttataaactttgcaaatctatggataagtcaatatatcatatt 
acgattatatcacttatgggtaggggagtatttgcaaataagttagaagcttatggtgtt 
aaagtttatacattaaatttaaataaatttaatgtactatttgtattgtttaaatatatt 
aagattatcagaagaataaagcctgatgttattcatgcttggatgtatcatgcaaatgta 
atttctatattatgcaagcctttttatagaaagactaaatatataaatagtataagaatg 
ggattggagaattatgatggtcataagaatcttacaaagtttatgataaagttgaatgca 
aaattttctaagttctcagatttaacattaaataattcaaagaaatcattagaagatcat 
caaaatataggttttaaaaaccaatgctttatagcaaatggttttgataaagatgttttt 
aaaccgagctttttaaagtatgaaaaatttcgtttaaataatgatttagatgataatgtt 
aaaattataggtatcatagcaagaaatcatgctgataaaaatatttctcgtttcttacaa 
atagctaatttattgttaaaaagtaatcctagtt:tacggtttttaatt:gctggaagagag 
tgttcgaaaatagatataggtagttatctagataacaaaagtaatgtaaataagtttttt 
gtatttgaatctgtggattctagtgaatacttaccagtattagatttatatttgtctaca 
tcaaaagttgaaggttttccaaatatacttgcagaagccatgctatgtgaagttcctatt 
gttgcttctaatgttggagattgtaaagatatacttaatggatacggtgaagtttttgag 
cttagtcaaggtaataaagaaataatagaaaagattatgaaagttttagaaacaacggta 
gtcatgaaaaagcgcatgagagaatatataataaataattttagtatagaagctattttg 
gaaaaacacgaaaaactttatcatgagggcagtgtctaatgtgtggagtagtaggctttt 
actcatttaataaagaagaaggttttgactcaataattaatcaatcattgctttctataa 
agcatagagggtcggatgatagtgggtattggtgcgacaatcaagttactctggggcata 
ctagattatcaatacacgatataactaatgcgggacatcagccaatgttatctaatagcg 
gtaatactgctattgtgtttaatggagaaatatataattacttatccataaaaaatcagc 
tattaagtgaatattcaaatcttaaatttaaaagtaacagtgatactgaggttttggtca 
atgctattgaactttggggtatagataaaactttagaaaaatgcataggaatgtttgctt 
ttggagtttacagtagaaaaactagttgcttaatactagctagagatagatttggcgaga 
agccattatattttggtatccaaaatggtattttgggttttgcatcagaattgaaggcac 
ttaagccattaaaggaatgtggctggaggtttgatatagatagagatgctttagcaacat 
atatgaggtatgcttatgtaccaacaccatactctatttataaaaatatatctaaactaa 
atgtaggtagttacataaaatttgatgctaaaggtaatagtaaagagtataaatattggg 
attctaaaaaagtactagattcagaaaaatataaagattcgtatgatcaagcaatcctag 
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Fig.6 (Cont III). 

atttagaaattaagcttaaaagtacactatcaatacaaatgcagtcagatgttcctctag 
gagcatttttatccggaggaattgactcaacaactgtagttgctcttatgcaaagtatgt 
ctaaagataagataaacacttttagtataggt-tttaatcaaaaagaatataatgaagctg 
agcatgcaagagcagtagcaaaacatataggtacaaaccacacagatatgtatgttacag 
aaagagatgctcttgatgtaataccaaaacttgctggaatatatgacgagccctttgctg 
attcatcacaaataccaacgtatcttgtgagtaaaatagctaagtcgaaagtaacagttg 
cactatcaggtgacgctggtgatgagctctttggcggttataatagatactttttagcac 
caaatattgctaaaaaaatcaaatttgctaagttacttaaatatgcaccagatgcttgga 
taaaaaaagctgagatattaaattttggtaagttcgctttattagcagataaactactaa 
aactaaaaagagttctcgaaaaagcaaaaacaaataaagagctttatgtactactttgtt 
cacaaataaatgatactagctttgtgttaggagcaaaagagtatgatatattaagagata 
agaatatttatgatattccacaattatctttccaagagtggatgatgtttgttgattcta 
atacatatatgatagatgatatattggttaaggttgatagagcagctatggctaactctc 
tagagacaagagtgccatttttagatcataatatttatgaatttgcttattccttaccaa 
ttgactataaaatacaacgaggtaacggaaaaagaattttgaaagatttgttatataaat 
atgtgccagaaagtttggtcaataggtctaagatggggtttggtattccgcttgctaaat 
ggttaagagaagatttacgagagtgggcagataatttactggattatagtaaaatagaca 
agcaaggttacttaagtcctgaggtggtgcaaaaatattggcaagagcatttgagtggta 
aaagaaattggcaagcaatattatggaatattctaatttttcaggagtggttagataatg 
agtaaagtaaatgtaacaaaaccatacttaccagatataaataaatataaaagctatgta 
aataaaatatacaaaaatggatggcttactaataatggtccgttagtgcaagagctagaa 
aaaagacttgcaaagtatctaggtgttaaaaatatagttttagtatcaaatggtacaatt 
gcattagaaatcgcgtatagagcgttaggagtcaaaggaagtgcaattactactccattt 
tcatttgttgctactacatcttcattggtttctaacaatgtaaaaccagtgtttgttgat 
attgatgagaatactctaagtatagacgtctctaaaattaagtatgctattgaagaggat 
acttcagctattgtgccagttcatgtgtttggaaatggttgtgaagttgaaaaaatagac 
atgctggctaaaaaacataacttaaaagttatttatgatgcagcacatgcttttgatgtt 
aagtataagggtgagagtatattaaactatggtgatatttcgacattaagttttcatgca 
acaaagatttttcattctattgaaggaggtgcgcttatcattaatgatgatagtcttgtt 
gaaaaagttcgttatttcattaattttggtatagaaagctcagaatcaataccttactta 
ggtactaatgctaaaatgaatgaatttgaggcggctatgggactttgtgttctagatgat 
attatagaaattaagagcaaaaggaaagttattacagagatatatgaggctgggttagat 
ggattggtaaagtttcaagaacagaatcagcattctagtaggaattatagctattttcca 
gtaatatttaggactgaggaggaacttctcagagtacagaaagcactaatacaaaatgat 
ataatatcgcgtagatatttttatccatcattagatagtcttagttatatagagccaaag 
cagtatatgccaatctcaagagatatatctaaaagaatattatgtttgccaatttatgca 
gagttagaagacgataaaattaataaaataattaataatatcaaagaggtttcctcatga 
aaaaaatatttgttgttacagataatagaactattctaagtgattttaaaaatatcattg 
gtagtaaaaatgatgtacaggttgattatttttgtagtttcaagagtcaaacttcttttg 
ccaaagaaatatataacagtgagattaagccaatagatatgaaaaaaaatggcaatgatc 
ttattggtaagtatgatttaggtttttcttgtcattcgaaacaattatttccagcaaaat 
tagttaattcagtattatgtataaatattcatcctggacttaatccatataatagagggt 
ggtttccacaggtcttctctattataaataaactacctataggagcaactattcatgtga 
tggatgaagagatagatcatggagatataatcattcaggaagaagttgaagttaattctt 
tcgaaaactcttttgatgtttatgctaaagttcaaaaaaaagaagttgagttgttcacta 
aagtcatagatgatattttgaataataagttcactcgaatcaaacctaactccgaaggca 
actataattcaattcatgattataaaaacatgtgtgaaattgatttagataaaatagtaa 
caatgcgggaagcaattgactatctaagggctatgacacaccctccatataaaaatagtt 
atttcattgatgagcatggaaataaagtatttgttgctcttgaacttgaaaagataagtt 
agaaaaatgagccttaaaaaaaatacaatatcaaattatataacacaactatatactagc 
ttaattggtattgttatacttcctttgtatttacaacatttaagtcatgatgcatttggt 
ctgattggtttttttacagtttttcaaacgtggttacggttgttggatgttggtataaca 
ccaactttatcaagagaagtggctcatgttagaggtagtactgatgactatcattactta 
cgcaagttggttagatcgttagagctatttttcattattgttggtgttctggtatttatt 
gtaattagtacacattcaaggtatatatccacctcttggttacatataggctcgctagat 
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Fig.6 (Cont IV). 

gctgatagtgtaagtgtatgtattgcacttatgggtttaatgtttgcattaagatgggtg 
tctgatctatatggtggtggtttgcgtggctttgaaagacaggttctttataataattta 
agtatcatacaaacgacactacagtttattggtggattattatttatctgctatgtgtct 
actaatattatgtattattttgtatatcagacaataattgcgatactatatctagtatgt 
attgcaattgcattttataaaatactaccatcatcatttagcgtgggtttaaggtttgat 
tttaaaataattagaaaagtgctt.ccatttgcactaggcattgcatattctacaacagtt 
tggattattgtcactcaatctgataaattagtgttctcacatgtattaccattatctgag 
tatggttatttatctttattgatagtgatatctagtgctgttacgatattgtcctctccg 
attagcatagctattcagcctagaatgacaatgctattagcccaacaaaatgtaaaagga 
atggaaagcttatatttaaaatcatccttgatctcaattacttttttatctgctgtagta 
acatgtgttttgatgtattctcatcagctgttgcagtcatggacaggaagtatggaaatt 
gctaattggggtagtaatatcttaaatatatatgttttatcagcatctattatttgtata 
atatcatttcaatattttttacagtatgcttatggtaagttaaagctacataatacatat 
aatacaattagtttagtattttttgctcctatagttatatatactgcttataattatgga 
gtgtatactacagcactattatggcttggatatgctatagtggggctgataatctggatg 
cctattgtacaccatgtatttgctaaaggtatcaataggtatttttttataaatttagca 
gttattactatagtatgttttttattatcgttaatatttaagggttggtatatttatcca 
agtaaaattgggttggtagaattaatattgattgggtttgcatttttatttatacaaatt 
tgtatagagtatgttttgtttcggtacaaggttttgaggtgtatagatgattaaagtttc 
agtatgtgtgatgacatacaatcaagaaaagtatattggtcaatgtttagagtctttggt 
tactcaagagactgattttgactttgagataatcgttggagatgatttttctacagatgg 
tacaagagatgttattcaagagtatcaaaaaaagtatccggatatcataaagccagtttt 
tagagataagaatgtgggaattactgaaaatattaaagaaatctattttgttgcaaatgg 
tgagtatatagctcatatggatggtgatgattatgcattgcctggtaaacttcaaattca 
ggctgattttttggataataatccaagatgtacgggagtttttcataatataaatatact 
ctatccaaatggtaatatacaacatagtaggtttgcttgttcaaataagagtatattcaa 
tttatcagacactttacgcggagttgctgttggtgcaaatagttcaaaaatgttcagaac 
atcggttttggatgatttgattttaccggatatagagcttctagattattattttcatgt 
tataacagcagaaaaaggttatttaagttttttaaattctaatgaatcctatagtgtgta 
cagaaaaggtattggtatcacatctaagtctaaggaaaaaatctataatacttatgctgg 
attatttgaatattttttggatagatatcctgaagagaaattaaatatttgtatccctgt 
tgtgcaaatgataatttcggctattaaagggagatgttttattagtgctattcgtctatt 
caaaattttaattagatcaagatgtattccattagtaagttggtttaaatatagatttga 
aaaataaatatcatttagaggattatgtgaaatgaagggaataattctagctggtggcag 
tggtacaaggctatatccacttaccttgggtgttagcaaacagctgctacctgtttatga 
caagccattgttatactatccactatctgtgcttatgcttgcaggtattagggagatatt 
aattatctctacagtgcgtgatatctcacttatccaagagcttcttggtgatggttcaca 
atttggtatacagttgagttataaaatccagccatcaccagatgggcttgctcaagcatt 
tattcttggtgaggagtttttggcgggtgactcagcttgtttgatattaggagataatat 
ctactatggtcaaggtatgactacaatgctagagtctgcaagagcacagtgtggaggtcc 
agctggtggcgcttgtgtttttggttattatgttaatgatccgcatagatatggtatagt 
cgaatttgataagcaaaaaaatgtaatttcggtagaggaaaagccacagaatcctaagtc 
acactatgctatcacaggtttatatttttatgataataatgttgttgagtatgctaaaca 
agtcaaaccatctgcacgtggtgagctagagattacttcacttaatgagttatatctaaa 
agaaaataagctaaatgtcgaactcttagggcgtggctttgcttggcttgatgctggtac 
gcatgattcattgctagaggcaggtcaatatgtcgcaactattgagaaaagacaagggct 
taaaattgcatgtttggaagaaattgcatggcgtaaaggctttatctcaacacaacaagt 
tctagctcaagctgaaaaactttctaagacagagtatggtcagtatctgaagaatttaat 
taaggatggtttataaattaatccgtcatacccatgaaggtgggtatctcataaaagttg 
gatatgttttggagattccaatctgcgcagtaatgacaggtttggtaatatatagcgatg 
ttttacaatgactaaaaatggttttatgtatattcttacaaataaggataatactgttct 
gtacatagttgtaacatctaatttgataaaaagaatgtatgagcataaacatagccttgc 
agatggttttactaaaaatataatgttaataagttagtttattttgaaatttatgaagat 
ataaaagcagcaattctgtgagaaaagcagttgaaaaaatgaaacagatcttggaaagaa 
cgaattattaatgagatgaatccgaattggaatgatttatatgaattaatatgtgagtaa 
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Fig.6 (Cont V). 

aacttttgtcttactggtgcagataggtatctctaaatatcagatgtgattgggagatta 
ccgcctacgcggtaatgacaagtttatgcggtaatgatagtttagtgagagaatgactag 
tcactataggaatgatgatgtaatgaggaatgaaaaaatgaactacaaaccaaaaaatat 
cctagtaacaggtgcggcgggatttattggtagtaactatgtgcgtatgatgttatcacg 
ctatagtgatatcaaaataatctcgtatgataagcttacttatgcgggtagtttagataa 
tctaaaagacttgaataatgaacataaccatacttttataaaaggtgatatttgtgatga 
agttttagtatatcaaacactgaaagaatataaaattgatacgatagtacattttgctgc 
agaatcgcatgttgataattcaattgctaatccaaaggtatttttagaaacgaatgtgat 
aggtacatttacacttttagattgtgctaaaaggtattggttagatgagctaggtttaga 
agaaactagttgtaggtttcatcatgtatctactgatgaggtatatggtaccttggcaaa 
agatgaaccagcctttactgagattaaggcttatgagccaaattcaccgtattcggcatc 
taaggcgggatctgatcatatttctagagcatatcatcatacctataaacttccggtaac 
aatttcaaattgttcaaacaactatggaccataccaacatcgagagaaattaatccctgt 
agtgataaatagttgtataaactacaagcctattcctgtttacggagatggttcgaatat 
tcgagattggctatatgtagaagatcactgcgatgctatccagacaattgttgagaaagg 
agtggttggagaggtttataatattggtggtattaatgaagttgataatctaaccttggt 
aaaaactatctgtaaactaatggatgaatataaaccagaaaatgctccacattctaactt 
aatcacatttgtggaagatagaaaaggacatgattggcgttatgctattgataacagcaa 
gattcagaatgagttaggatggaagccatcacaagattttgataagatgtttagacaaac 
tattgagttttatctatagcttaaatatttatcttatgagtatctctaaaaaatcaattt 
aatttatttttgtgttaaaaagtagtgttcgcaagaatatagttaatccgaaagatattt 
gtagaaaaagatatttgtagaaatgttataatgtctaataaaa 
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